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Problems  of  optimal  surveillance  and  contact  imbrmation  processing  are 
investigated  with  ASW  applications  in  view.  The  optimal  allocation  of  ASW 
surveillance  resources  is  investigated,  by  way  of  Bayesian  statistical  analysis, 
in  a somewhat  idealized  surveillance  situation  involving  a moving  target  and  the 
possibility  of  false  contacts.  In  the  problem  considered,  the  optimal  allocation 
of  surveillance  effort  is  shown  to  be  the  solution  of  a certain  dynamic  programming 
problem.  The  optimal  allocation  is  determined  numerically  in  a number  of  special 
cases  and  compared  to  several  simple  allocation  policies.  The  properties  of 
these  various  policies  are  investigated  through  analysis  as  well  as  through  a number 
of  numerical  examples.  Particular  attention  is  paid  to  asymptotic  behavior  of 
long  term  surveillance  policies.  One  suboptimal  policy,  the  maximum -information- 
gain  policy,  is  shown  to  have  a number  of  very  desirable  properties. 


Procedures  for  processing  ASW  contact  information  are  developed  with  two 
distinct  applications  in  mind.  The  first  application  is  directed  toward  estimating 
the  track  of  a specified  target  as  well  as  toward  drawir^  inferences  about  overall 
target  behavior  patterns  from  contact  data  on  a number  of  different  targets.  The 
approach  is  to  combine,  using  Bayesian  methods,  contact  a scenario- 

based  parametric  model  for  target  motion.  Statistical  estin  .ocedures  are 

given  for  estimating  the  track  of  a specified  target  from  conta*.  ata.  Additionally, 
methods  are  given  for  estimatii^  the  parameters  of  the  motion  model  from  contact 
data. 

The  second  application  is  processing  ASW  contact  information  in  a multi -target 
environment  where  there  is  ambiguity  in  assigning  contacts  to  targets.  Several 
Bayesian  statistical  methods  for  the  systematic  generation  and  updating  of  target 
location  predictions  in  a multi-target  environment  are  developed.  Of  the  ones 
considered.  Extended  Memory  processing  is  shown  to  be  the  only  computationally 
practical  method  that  makes  accurate  use  of  the  observational  data. 
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PREFACE 


This  is  a report  by  Daniel  H.  Wagner,  Associates  to  the  Naval  Analysis 
Division  of  the  Office  of  Naval  Research  (Code  431)  on  a research  investigation 
performed  under  ONR  Contract  No,  N00014-76-C-0676,  This  report  is  directed 
towards  developing  methodologies  for  processing  ASW  surveillance  information 
with  the  objective  of  obtaining  estimates  of  target  location.  It  is  intended  primarily 
for  use  by  analysts.  The  methods  have  been  motivated  by  actual  ASW  information 
processing  requirements.  The  goal  has  been  to  achieve  useful  and  computationally 
practical  methods  which  are  suitable  for  real-time  assistance  to  ASW  surveillance 
operations. 

We  would  like  to  express  our  appreciation  for  the  excellent  cooperation  and 
support  that  has  been  given  to  this  work  by  Mr.  J.  Randolph  Simpson  and  CDR  Ronald 
James  of  the  Naval  Analysis  Division  of  the  Office  of  Naval  Research, 

Additionally,  we  would  like  to  recognize  the  capable  efforts  of  our  colleagues 
B,  D.  Wenocur  who  programmed  the  ASWIPS  model  discussed  in  Chapter  IV,  and 
R.  V.  Kohn  and  S.  S.  Brown  who  contributed  a section  to  Appendix  A. 
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SUMMARY 


This  report  is  directed  towards  the  problem  of  providing  estimates  for  a 
submarine  target's  location  under  various  conditions.  The  objective  is  to  develop 
useful  and  computationally  practical  tools  for  obtaining  such  estimates.  The 
principal  application  is  the  continuous  long-term  localization  of  a target  or 
targets  through  the  utilization  of  surveillance  efforts.  Much  of  this  current  work 
has  been  motivated  by  the  methods,  results,  and  conjectures  contained  in 
reference  [a], 

, Chapter  I provides  a brief  introduction,  and  appendices  A and  B provide 
supplementary  material.  The  issue  of  optimal  surveillance  in  a false  contact 
environment  is  explored  in  Chapter  n.  Processing  ASW  contact  information  so 
as  to  obtain  estimates  about  the  track  of  a specific  target  as  well  as  overall 
long-term  behavior  patterns  is  the  subject  of  Chapter  III.  In  Chapter  IV,  a 
computationally  practical  method  for  processing  ASW  contact  information  in  a 
multi-target  environment  is  given.  Computaticmal  results  are  included  in 
Chapters  n and  IV.  Unfortunately  the  work  presented  in  Chapter  HI  has  not  as 
yet  progressed  to  that  stage. 

Optimal  Surveillance 

Chapter  n addresses  the  problem  of  optimal  surveillance  against  a moving 
target  in  a false  contact  environment.  Although  the  surveillance  situation  considered 
is  somewhat  idealized,  our  results  provide  practically  useful  guidelines  for  considering 
adaptive  surveillance  operations.  The  goal  of  this  research  is  to  place  a practical 
and  useful  theory  of  surveillance  on  a sound  theoretical  basis. 

The  problem  of  specifying  the  location  of  a target  as  it  arises  in  surveillance 
problems  is  substantially  different  from  the  problem  usually  considered  in  search 
theory,  i.e. , the  problem  of  detecting  the  target  with  maximim  probability.  The 
difference  is  a consequence  of  the  fact  that  search  theory  generally  assumes  that  a 
detection  will  also  provide  the  desired  localization  of  the  target.  Thus,  the 
detection  itself  is  the  issue  of  importance.  In  surveillance,  the  issue  is  localization 
and  its  maintenance  over  time  and  this  goal  may  be  achievable  with  or  without 
detections  or  contacts.  In  addition,  surveillance  also  takes  into  account  the  fact 
that  a problem  may  not  end  with  a contact  because  of  poor  localization  information . 
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The  surveillance  operation  considered  here  involves  a moving  target  located 

in  one  of  N cells,  Cj^,  C2 Cjj.  The  precise  cell  containing  the  target  is 

unknown,  but  at  the  current  time  a probability  distribution  for  the  target's  location 
has  been  established.  Thus  for  2-1,  2,  . . . , N,  let  be  the  probability  that  the 
target  is  currently  located  in  cell  Cj  . Suppose  that  our  best  estimate  for  the 
target's  location  is  that  cell  which  contains  the  target  with  highest  probability.  * 

Indeed  if  Xj^  > , all  2 k,  then  the  target  is  most  likely  to  be  in  cell  Cj^,  and  the 

probability  that  this  estimate  is  correct  is  Xj^.  We  seek  to  apply  our  surveillance 
effort  so  as  to  maximize,  at  the  end  of  the  surveillance  operations,  the  probability 
that  the  target  is  in  the  high  probability  cell. 

The  surveillance  operation  itself  is  performed  in  a sequence  of  discrete  time 
stages,  each  of  duration  A time  units,  by  a single  surveillance  sensor.  We  are 
permitted  only  K stages,  and  we  must  use  the  sensor  so  as  to  obtain  the  best 
possible  estimate  for  the  target's  position  after  these  K stages.  At  the  start  of 
the  surveillance  operation,  the  fixed  time  at  which  we  must  obtain  the  best  estimate 
of  the  target's  position,  the  horizon,  is  KA  units  into  the  future.  Our  measure  of 
effectiveness  is  the  probability  that  the  target  is  in  the  high  probability  cell  at  the 
time  of  the  horizon  KA. 

At  the  beginning  of  each  stage,  a cell  is  chosen  and  is  then  investigated  for 
the  amount  of  time  A.  The  choice  of  cell,  in  general,  depends  upon  the  number  of 
stages  remaining  in  the  operation  (i.e.,  the  amount  of  time  remaining  until  the 
horizon),  the  surveillance  capability  of  our  sensor,  our  target  motion  assumptions, 
and  the  current  target  location  probability  distribution.  The  sensor  response 
results  are  then  used  to  update  the  target  location  probability  distribution.  Target 
motion  is  assumed  to  take  place  at  the  end  of  each  stage,  and  results  in  a new 
target  location  distribution.  The  next  stage  begins  with  this  new  target  location 
probability  distribution  and  is  the  beginning  of  a surveillance  operation  involving 
one  fewer  stages.  The  phasing  of  these  various  activities  is  indicated  in  Figure  S-1. 

Observe  that  the  surveillance  operation  described  above  is  dynamic  in  that 
the  allocation  of  effort  at  each  stage  depends  upon  the  results  of  the  previous 
stage.  This  is  comparable  to,  say,  a VP  operation  where  flights  are  flown  daily. 

The  results  of  each  day's  effort  together  with  the  target  motion  assumptions  are 
then  used  to  decide  the  allocation  of  effort  for  the  next  day's  flights. 

In  order  to  illustrate  the  concepts  involved,  consider  a surveillance  operation 
performed  on  three  cells.  Figures  S-2  and  S-3  compare  the  expected  degree  of 
target  localization  provided  by  three  surveillance  plans  against  a moving  target 
in  a false  contact  environment.  Figures  S-2  and  S-3  represent  the  same  target 
motion  assumptions  and  the  same  surveillance  capability.  The  surveillance  capability 

* It  is  possible  to  consider  other  measures  of  target  localization,  for  example, 

the  probability  that  the  target  is  located  in  the  two  nighest  probability  cells. 

Such  generalizations  are  discussed  in  detail  in  Chapter  n . 
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is  defined  by  the  sensor  response  matrix  R - [rjj  ],  where  ry  is  the  probability 
of  a response  from  investigating  cell  i if  the  target  is  located  in  cell  j.  Target 
motion  is  assumed  to  be  a Markov  chain  with  the  transition  matrix  M.  In  both 
figures  the  target  is  restricted  to  one  of  three  cells.  In  Figure  S-2,  however, 
the  prior  target  location  distribution  is  assumed  to  be  (1,0,0)  whereas  in 
Figure  S-3  it  is  (.5,  .1,  .4).  The  K-stage-optimal  surveillance  plan  is  the 
optimal  policy.  The  1-stage  look-ahead  maximum -information -gain  policy 
maximizes  at  each  stage  the  expected  information  content  of  the  posterior 
probability  distribution  at  the  end  of  the  stage*.  The  highest-probability-cell 
policy  investigates  at  each  stage  the  current  highest  probability  cell.  Both 
the  1-stage  look-ahead  maximum-information-gain  policy  and  the  highest- 
probability-cell  policy  are  very  easy  to  determine  computationally. 

In  order  to  understand  these  figures,  consider  the  situation  presented  in 
Figure  S-2,  The  target  is  assumed  to  be  localized  to  a single  cell  at  the  start  of 
the  surveillance  operation  (the  initial  target  location  distribution  is  (1,0,0)), 

Suppose  first  that  no  surveillance  effort  is  applied.  Because  of  target  motion 
the  degree  of  target  localization  will  decrease  with  time.  The  no  surveillance 
curve  (the  bottom  curve  in  Figure  S-2)  represents  the  expected  degree  of  target 
localization  one  can  attain  after  a specified  number  of  stages.  The  other  curves 
similarly  represent  the  expected  amoimt  of  target  localization  one  can  expect 
after  a specified  number  of  stages  using  the  various  surveillance  plans.  The 
upper  curve  is  the  theoretical  maximmn  expected  probability  that  the  target  will 
be  localized  to  a single  cell. 

Observe  that  the  1-stage  look-ahead  maximum-information-gain  policy  performs 
almost  as  well  as  does  the  optimal  policy.  This  behavior  has  been  observed  in 
most  of  the  surveillance  situations  studied  to  date  (see  for  example  Figures  II-2 
through  II-9) , and  so  this  policy  appears  to  be  a reasonably  good  suboptimal 
surveillance  policy. 

Figures  S-2  and  S-3  also  illustrate,  quite  dramatically,  the  conflict  between 
target  motion  and  the  application  of  surveillance  effort.  In  Figure  S-2,  for 
example,  it  is  assumed  that  the  target  is  completely  localized  to  a single  cell  at 
the  start  of  the  surveillance  operation  (i.e. , the  initial  target  location  distribution 
is  (1,0,0)).  Because  of  this  complete  localization,  a surveillance  operation  lasting 
only  a few  stages  will  have  a high  probability  of  success,  even  if  no  surveillance 
effort  is  applied.  Because  of  target  motion,  however,  the  degree  of  expected 
localization  decreases  rapidly  to  the  limiting  values  indicated  in  Table  S-1.  A 
surveillance  operation  against  a moving  target  gains  little  after  a large  number 
of  stages  from  the  knowledge  that  at  the  beginning  of  the  operation  the  target  was 
perfectly  localized. 


The  information  content  of  the  probability  distribution  (Xj^,  . . . , Xj,j)  defined  in 

N-cells  is  the  nonnegative  number  Z x.  Inx.  + InN. 
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Observe  that  if  no  surveillance  effort  is  applied,  the  initial  complete  localization 
to  a single  cell  will  decay  to  a probability  of  1/3  that  the  target  is  located  in  any 
given  cell.  The  interaction  between  surveillance  and  target  motion  is  reflected 
in  the  difference  between  the  probability  of  correctly  specifying  the  target's  position 
in  the  no  surveillance  case  and  that  provided  by  the  various  surveillance  plans . 

In  Figure  S-3  the  target  location  distribution  at  the  start  of  the  surveillance 
plan  is  (.5,  . 1,  .4),  so  that  the  initial  probability  of  correctly  specifying  the  target's 
location  is  .5.  For  surveillance  operations  lasting  less  than  five  stages,  the  optimal 
surveillance  policy  and  the  maximum-expected-information-gain  policy  are  able 
to  overcome  the  effects  of  target  motion  and  slightly  improve  the  extent  of  target 
localization.  For  surveillance  plans  lasting  more  than  10  stages,  however,  the 
optimal  surveillance  plan  (and  therefore  the  other  plans  as  well)  are  unable  to 
overcome  the  effects  of  target  motion,  and  so  the  initial  localization  is  better  than 
the  expected  extent  of  target  localization  at  the  end  of  the  operation. 

An  extremely  interesting  aspect  of  Figures  S-2  and  S-3  is  the  rapidity  with 
which  the  expected  degree  of  target  localization  converges  to  a fixed  value.  Moreover, 
as  indicated  in  Table  S-1,  for  a given  surveillance  plan  the  asymptotic  expected 
target  localization  depends  only  on  the  surveillance  plan  and  not  on  the  initial 
target  location  distribution.  This  indicates  that  in  the  case  at  hand  precise  knowledge 
of  the  initial  target  location  distribution  is  unimportant  to  the  long  teim  ability 
of  a surveillance  system  to  localize  a target. 

Because  of  the  speed  with  which  the  payoffs  for  surveillance  plans  converge  to 
their  limiting  value,  this  value  represents  a good  but  simple  way  of  compaiMng 
surveillance  plans  without  introducing  a time  horizon.  The  existence  of  these  limits 
for  the  K-stage-optimal  surveillance  plans  are  established  in  Theorems  n-2  through 
II- 5. 


The  surveillance  system  described  and  illustrated  in  Figures  S-2  and  S-3  is  an 
example  of  a homogeneous  surveillance  system.  A homogeneous  surveillance  system 
is  one  in  which  the  sensor  response  matrix  is  of  the  form  R = [r^j  ] where 


j if  i / j 

U in=.)  . 

Additionally,  observe  that  the  target  motion  transition  matrix  has  the  special  form 
M [djjl  where 
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TABLE  S-1 


LIMITING  VALUES  FOR  EXPECTED  TARGET 
LOCALIZATION  PROVIDED  BY  VARIOUS  SURVEILLANCE  PLANS 


CASE  I 

CASE  n 

Optimal  Surveillance 

Policy 

.496 

.496 

Maximum-Expected- 

Information-Gain 

Policy 

.464 

. 464 

Highest-Probability- 
Cell  Policy 

.440 

.440 

No  Surveillance 

.333 

.333 

ij 
I ■ 


] 

s 

ii 
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1] 


N 


1-6 


(N-1) 

N 


if  i j 

if  i = j. 


Setting  6 = 0 yields  the  special  case  of  a stationary  target.  Note  that  the  example 
discussed  in  Figures  S-2  and  S-3  is  a homogeneous  surveillance  system  with 
A = . 1,  /i  = .01,  and  6 = .15. 

The  optimal  whereabouts  search,  introduced  by  Kadane  in  reference  [bj,  is 
the  special  case  where  p = 0 and  6 = 0.  In  this  situation  the  target  is  stationary  and  a 
sensor  response  can  occur  only  if  the  target  is  in  the  cell  being  investigated.  Thus 
a sensor  response  completely  localizes  the  target  to  a single  cell.  The  allocation 
of  surveillance  effort  in  an  optimal  whereabouts  search  with  discrete  effort  is  always 
to  deploy  the  sensor  to  the  second  highest  probability  cell.  See,  for  example, 
section  4.4  of  reference  [c].  Note  in  particular  that  the  optimal  allocation  depends 
only  on  the  current  target  location  probability  distribution  and  not  on  the  horizon. 

The  resulting  surveillance  plan  thus  yields  uniformly  optimal  probabilities  of 
localizing  the  target  for  any  possible  horizon. 

We  now  view  the  K-stage  optimal  surveillance  plan  for  the  homogeneous  sensor 
as  a generalization  of  the  optimal  whereabouts  search.  Remarkably*  shown  in 
Chapter  n,  our  preliminary  theoretical  analysis,  together  with  our  numerical  results, 
indicate  that  the  optimal  allocation  of  surveillance  effort  when  > 0 is  the  same  as 
for  the  optimal  whereabouts  search,  i.e, , when  n =0.  Thus  for  any  homogeneous 
surveillance  sensor  and  any  target  motion  matrix  of  the  specified  form,  we  conjecture 
that  the  K-stage  optimal  surveillance  plan  requires  that  we  allocate  all  our  effort 
to  the  second  highest  probability  cell.  ^ 

If  this  conjecture  can  be  proven,  it  will  have  a number  of  important  cwisequences. 
First  note  that  such  a surveillance  plan  depends  only  on  the  current  target  location 
probability  distribution  and  not  on  the  number  of  stages  in  the  operaticm  and  is 
thus  a 1-stage  surveillance  plan.  This  plan  results  in  uniformly  optimal  probabilities 
of  localizing  the  target  for  each  possible  choice  of  horizon.  Moreover,  we  feel  that 
it  is  reasonable  to  model  many  operational  situations  with  a homogeneous  sensor. 

Since  in  this  case  the  K-stage  optimal  surveillance  plan  would  give  optimal  results 
for  every  horizon,  it  has  potential  for  widespread  applications.  Finally  observe 
that  if  this  conjecture  can  be  established,  then  the  optimal  surveillance  plan  in  the 
case  at  hand  can  be  specified  for  any  number  of  cells  without  resorting  to  complicated 
computational  optimization  methods. 


^Since  this  was  written,  a counterexample  has  been  found  by  J.  R.  Weisinger. 
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A Statistical  Model  for  Processing  ASW  Contact  Information  to  Estimate  Target 
Patterns  of  Operation 

Two  major  problems  which  an  ASW  planner  must  frequently  face  in  the  presence 
of  sparse  contact  data  of  variou''  types  and  quality  are  (1)  to  obtain  an  estimate  for  the 
track  of  a specified  target,  and  (2)  to  make  inferences  about  overall  target  behavior 
patterns  on  the  basis  of  contact  data  on  several  targets.  The  objective  of  Chapter  ni 
is  to  outline  a Bayesian  method  for  obtaining  these  estimates.  Unfortunately,  this 
work  has  not  yet  been  developed  to  the  point  where  numerical  results  can  be  computed. 

The  goal  in  undertaking  such  a study  is  to  provide  ASW  information  processing 
systems  with  the  capability  of  combining  historical  data  (in  the  form  of  prior 
estimates  on  target  patterns  of  operation)  with  contact  data  on  targets  of  current 
interest.  It  is  hoped  that  this  marriage  will  provide  improved  target  location 
estimates  in  that  it  will  methodically  exploit  contact  data  on  all  previous  targets. 

Our  approach  is  based  on  a parametric  model  for  target  motion.  The  object 
is  to  use  the  available  contact  data  to  obtain  posterior  estimates  for  the  parameters 
which  describe  target  motion.  A major  consideration  here  is  the  development  of  a 
parametric  model  for  target  motion  which  is  rich  enough  to  model  real  world 
situations  but  which  is  also  computationally  tractable. 

The  approach  considered  here  is  most  applicable  in  the  case  of  transiting 
targets.  Since  the  approach  is  Bayesian,  it  requires  a general  form  for  patterns 
of  motion  characterized  by  parameters  for  which  there  are  reasonable  prior  estimates. 
These  prior  estimates  may  be  based,  for  example,  on  past  experience  or  on  certain 
operational  or  geographical  constraints.  The  Bayesian  approach,  however,  enables 
one  to  obtain  estimates  of  target  operation  patterns  in  the  presence  of  sparse  data. 

The  parametric  model  for  target  motion  is  based  on  the  notion  of  weighted 
target  scenarios.  We  postulate  the  existence  of  a finite  number  of  such  scenarios, 
or  patrol  track  plans,  which  a target  might  follow  during  a specified  phase  of  its 
mission.  Each  scenario  may  be  thought  of  as  corresponding  to  a basic  geometric 
pattern  of  target  motion. 

Each  scenario  is  characterized  by  a mean  target  track  and  corresponding  covari:ince 
matrix.  Once  a scenario  has  been  chosen  for  a given  target,  the  target  must  move 
roughly  according  to  the  mean  track  of  the  scenario.  The  target,  however,  is 
permitted  to  operate  with  some  deviation  from  the  mean  track.  For  example, 
the  target  may  move  faster  or  more  slowly  than  the  specified  mean  track,  or  it 
may  vary  its  course  along  the  mean  track.  The  extent  of  these  perturbations  in 
target  motion  and  their  correlation  are  determined  by  the  covariance  matrix 
a.ssociated  with  the  scenario. 

An  example  of  some  basic  tracks  which  might  be  used  to  define  scenarios  is 
given  in  Figure  S-4.  Additionally,  two  sample  target  tracks  drawn  from  two  different 
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target  scenarios  are  indicated  in  Figure  S-4,  In  our  model  we  deal  with  discrete 
time  and  a track  is  specified  by  giving  the  target's  location  at  times  t = 1 t . 


To  formulate  more  precisely  our  parametric  model  for  target  motion,  suppose 
that  there  are  J possible  target  scenarios,  S2,  ....  Sj.  Let  pj  be  the  probability 
that  a target  will  follow  scenario  Sj.  Additionally,  assume  that  the  conditional 
distribution  function  of  the  target's  track  Z = (Zj,  . . . , z^)  is  a multivariate 
normal  distribution  with  mean  pj  and  covariance  matrix  Note  that  is 

an  element  of  IR^^  and  that  is  a 2t  x 2t  positive  definite  symmetric  matrix. 

Now  assume  that  the  parameters  in  the  target  motion  model  described  above 
are  unknown  to  us.  By  this  is  meant,  in  particular,  that  the  following  quantities  are 
not  known: 

i)  the  vector  p = (p^,  P2,  . . . , Pj)  which  gives  the  prior  probabilities 
that  a target  will  move  according  to  a given  scenario, 

ii)  the  mean  target  paths  pj,  p2>  • • ••  Mj  of  the  scenarios,  and 

iii)  the  covariance  matrices  22^,  . . . , of  the  scenarios. 


We  do  assume,  however,  that  the  number  of  possible  target  scenarios,  J,  has 
already  been  established. 

The  object  of  our  investigations  is  now  twofold.  First,  the  contact  data  obtained 
on  a specified  target  are  used  to  obtain  a Bayesian  estimate  for  the  track  of  the 
target.  Secondly,  the  contact  data  are  processed  to  obtain  Bayesian  estimates  for 
the  parameters  p,  pj,  and  ...,  . Computational  methods  for 

obtaining  these  estimates  are  given  in  Ch^ter  in. 


ASW  Information  Processing  in  a Multi-Target  Environment 

In  Chapter  IV  we  develop  Bayesian  statistical  methods  for  processing  ASW 
information  in  a multi-target  environment.  Specifically,  suppose  that  a force  of 
submarine  targets  is  known  to  be  operating  in  a particular  ocean  area  and  that 
periodic  location  estimates  are  to  be  generated  for  each  target.  Potential  sources 
of  information  to  assist  in  target  force  localization  might  include:  ocean  characteristics 
that  affect  navigability  such  as  water  depth;  submarine  operational  characteristics 
such  as  speed  range  and  frequency  of  course  changes;  and  surveillance  information 
such  as  the  direct  observation  of  port  arrivals  and  departures  and  contacts 
generated  by  ASW  sensors.  This  localization  problem  leads  naturally  to  the 
consideration  of  ASW  information  processing  systems,  i.e.,  systems  designed 
to  accept  as  input  diverse  ASW  information  of  the  type  described  and  generate 
as  output  periodic  estimates  of  the  locations  of  the  target  submarines . 
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The  methods  developed  under  this  problem  are  applicable  to  issues  of  vulnerability 
to  surveillance  as  well  as  to  issues  of  ability  to  exercise  surveillance. 

The  principal  emphasis  in  Chapter  FV  is  on  the  development  of  Bayesian  statistical 
methods  for  the  systematic  generation  and  updating  of  target  force  location  predictions. 
It  should  be  noted  that  the  surveillance  problem  considered  is  an  intrinsically  multi- 
target problem  as  opposed  to  a composite  of  many  isolated  single-target  problems . 

This  results  . ^st  from  the  fact  that  except  for  port  departures  and  arrivals,  the 
submarine  t ts  are  treated  as  observationally  indistinguishable.  Secondly,  all 
targets  are  ,ir  ; med  to  make  potential  use  of  the  entire  operating  area  available 
to  the  force  s-  chat  individual  targets  are  not  restricted  to  operate  in  disjoint 
subregions.  Under  these  restrictions  it  will  frequently  be  the  case  that  there  is 
considerable  uncertainty  about  the  exact  identity  of  the  target  that  generated  a 
sensor  contact.  This  uncertainty  leads  to  correlation  in  estimates  of  the  locations 
of  the  targets  comprising  the  force  under  surveillance  even  though  the  underlying 
target  motions  may  be  statistically  independent.  It  is  from  this  intrinsic  multivariate 
nature  of  the  problem  that  the  information  processing  difficulties  ccnsidered  in 
Chapter  IV  arise. 

A substantial  portion  of  the  discussion  in  Chapter  IV  deals  with  specific  multi- 
target  Bayesian  information  processing  algorithms  and  their  comparative  evaluation 
based  on  numerical  examples.  A particular  such  algorithm  which  we  call  Extended 
Memory  processing  will  emerge  as  the  only  one  of  the  Bayesian  methods  we  consider 
that  both  accurately  reflects  the  localization  information  inherent  in  the  observational 
data  and  is  at  the  same  time  computationally  practical. 

Examples  IV-l  and  rV-2  of  Chapter  IV  will  show  that  even  when  there  are  as  few 
as  two  or  three  targets  involved,  Bayesian  methods  that  fail  to  account  adequately 
for  correlation  in  target  location  estimates  can  break  down  badly.  In  particular,  two 
such  methods  that  we  call  sequential  processing  and  parallel  processing  and  considered 
in  Chapter  rv  may  lead  to  complete  permutation  of  target  identities  and  other 
anomalous  behavior.  These  methods  also  lead  to  the  so-called  quiet  target  problem 
that  occurs  when  one  of  two  targets  involved  in  an  identity  confusion  is  substantially 
more  quiet  than  the  other.  In  such  a situation  Bayesian  processing  methods  that 
treat  target  locations  as  independent  tend  to  react  to  a stream  of  contacts  on  the 
noisier  target  by  concentrating  the  target  location  distributicais  of  both  targets  in 
essentially  the  same  location.  As  a result,  valid  representation  of  the  position 
distribution  of  the  quieter  target  is  completely  lost.  Examples  IV-l  and  rV-2  show 
that  the  Extended  Memory  method  solves  the  quiet  target  problem.  The  theoretical 
development  of  Extended  Memory  is  given  in  Appendix  B. 

In  order  to  provide  the  Bayesian  information  processing  methods  with  a framework 
to  be  implemented,  tested,  and  evaluated,  we  have  developed  a small-scale  Information 
processing  system  described  in  Chapter  IV  called  ASWIPS  (ASW  Information  Processing 
System)  on  a Prime  400  mini-iComputer. 
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The  processing  system  ASWIPS  is  currently  configured  to  handle  up  to  three 
targets  in  a discrete  10  x 10  cellular  grid.  Time  is  discretized  into  information 
processin'i;  update  stages.  Target  motion  is  taken  to  be  a symmetric  random  walk 
with  a transition  from  the  current  cell  to  any  one  of  the  adjacent  cells  equally 
likely.  The  output  of  ASWIPS  at  the  end  of  each  iqpdate  stage  cmsists  of  estimates 
of  the  current  locations  of  all  targets  in  terms  of  probability  distributions . These 
target  location  distributions  can  then  be  compared  with  the  actual  target  tracks  used 
to  generate  the  simulated  contact  data  input  to  ASWIPS.  Such  comparisons  then 
provide  the  basis  for  the  evaluation  of  the  predictive  capability  of  various  processing 
approaches.  In  particular,  the  calculations  involved  in  Examples  IV- 1 and  IV-2  and 
the  conclusions  these  examples  support  are  based  on  ASWIPS. 
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INFORMATION  PROCESSING  TO  MAINTAIN 
LOCALIZATION  IN  ASW  SURVEILLANCE 
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CJLA.p  t ER  I 
INTRODUCTION 


The  purpose  of  this  report  is  to  develop  methods  for  processing  ASW 
surveillance  information  so  as  to  obtain  localization  estimates  on  submarine 
targets  on  a continuing  basis.  Such  methods  need  be  obtained  as  useful  and 
computationally  practical  tools.  Potential  applications  include  ASW  information 
processing  of  surveillance  and  intelligence  data,  with  an  objective  of  providing 
improved  tactical  ASW  mission  planning.  This  report  is  intended  primarily 
for  use  by  analysts. 

A predecessor  report,  reference  (aj,  established  much  of  the  framework 
for  subsequent  investigations  in  ASW  surveillance  information  processing.  This 
earlier  report  attacked  two  issues  at  the  core  of  ASW  information  processing. 

The  first  issue  concerns  a collection  of  fixed  sensors,  in  a false  contact  environment, 
with  known  detection  capabilities.  The  problem  was  to  obtain  an  estimate  of  target 
position  on  the  basis  of  sensor  contact  information  (both  positive  and  negative).  The 
approach  taken  was  to  determine  weighted  scenarios  for  target  motion  and  to  compute 
posterior  target  location  distributions  and  scenario  weights  on  the  basis  of  the 
contact  information.  Much  of  the  work  contained  in  Chapter  n of  this  present 
report  was  motivated  by  this  earlier  reference. 

The  second  issue  concerned  a movable  sensor  with  known  detection  capability 
operating  in  a false  contact  environment.  The  problem  was  to  allocate  the  sensor  so 
as  to  serve  certain  tactically  useful  purposes,  e.g. , to  localize  the  target  to  a 
specified  number  of  cells.  This  issue  was  examined  in  an  exploratory  way  using 
Monte  Carlo  techniques.  On  the  basis  of  this  analysis  a number  of  important 
conjectures  were  formulated. 

The  first  such  conjecture  concerns  a concrete  connection  between  information 
theory  and  search  theory,  i.e.,  that  the  optima^  detection  search  extracts  information 
from  the  target  location  probability  distribution  at  the  maximum  possible  rate.  Such 
an  allocation  maximizes  the  entropy  of  the  posterior  distribution  given  failure  to 
detect.  This  conjecture  was  established  in  reference  [d]. 
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In  contrast  it  was  also  conjectured  in  reference  [a]  that  the  opposite 
connection  exists  between  information  theory  and  surveillance  theory,  i,e.,  that 
a reasonable  surveillance  plan  is  to  allocate  surveillance  effort  so  as  to  place 
information  into  the  target  location  probability  distribution  at  the  maximum 
possible  rate.  Such  an  allocation  minimizes  the  expected  entropy  of  the  posterior 
distribution.  (Much  additional  numerical  evidence  to  support  this  conjecture  is 
given  in  Figures  II-2  through  II- 9.) 

To  understand  this  difference  in  allocation  of  effort  it  is  necessary  to 
indicate  the  relationship  between  search  and  surveillance.  Search  and  surveillance 
are  closely  related  but  essentially  different  acitivites  with  different  goals  and 
correspondingly  different  methods  of  achieving  these  goals.  The  principal  objective 
of  search  is  to  obtain  a target  detection.  Moreover,  search  theory  generally  assumes 
that  the  detection  also  provides  the  desired  target  localization.  In  contrast, 
surveillance  is  concerned  with  localization  and  its  maintenance  over  time,  and  this 
goal  can  be  achieved  with  or  without  detections  or  contacts.  In  particular,  a 
surveillance  problem  may  not  end  with  a contact  because  of  poor  localization 
information. 


Chapter  II  is  an  investigation  into  the  optimal  allocation  of  ASW  surveillance 
resources  in  a somewhat  idealized  surveillance  setting.  The  analysis  is  intended 
to  establish  a practical  and  useful  theory  of  surveillance  on  a sound  theoretical 
basis.  The  problem  considered  involves  a movable  sensor  which  is  deployed  in 
a sequential  fashion  against  a single  stochastically  moving  target  in  a false  contact 
environment.  Among  our  results,  we  establish  that  a good  suboptimal  allocation 
of  surveillance  effort  is  to  maximize  incrementally  the  expected  information  gained 
in  the  posterior  target  location  distribution.  This  surveillance  policy  is  called 
the  maximum-information-gain  plan.  The  advantage  of  the  maximum-information- 
gain  plan  is  that  it  is  easily  computed  in  an  incremental  manner  and  does  not 
depend  on  the  time  horizon.  Finding  the  optimal  plan,  on  the  other  hand,  requires 
the  use  of  techniques  such  as  dynamic  programming  and  quickly  becomes  impractical 
for  target  distributions  with  large  numbers  of  cells  (say  10  or  more). 


The  objective  in  Chapter  HI  is  to  develop  a Bayesian  method  for  processing 
surveillance  contact  information  so  as  to  obtain  probability  estimates  for  a single 
target's  track  and  to  combine  contact  data  on  a number  of  targets  to  estimate 
general  patterns  of  operation.  Our  approach  is  to  develop  a Bayesian  method 
for  combining  ASW  contact  information  with  a scenario-based  parametric  model 
for  target  motion.  We  assume  that,  based  on  past  experience  and  general 
operational  considerations,  we  can  specify  a finite  number  of  general  operating 
plans  or  basic  tracks  called  scenarios,  which  could  be  followed  by  a target. 

These  scenarios  may  be  thought  of  as  corresponding  to  the  basic  geometric 
patterns  of  target  motion.  It  may  be  assumed  that  these  basic  patterns  are 
selected  for  each  target  in  random  fashion  and  then  specified  for  the  target  in 
its  operations  order.  Once  the  basic  scenario  for  the  target  has  been  specified, 


the  target  then  "chooses"  its  own  particular  variation.  For  example,  the  target 
may  move  locally  faster  or  more  slowly  than  the  basic  scenario  or  it  may  vary 
its  track  about  the  basic  scenario. 

Bayesian  methods  have  been  extensively  used  in  developing  real-time  computer 
programs  to  produce  a sequence  of  updated  probability  distributions  for  target 
location.  The  target  motion  model  assumed  in  Chapter  III  is  closely  related  to 
models  developed  by  H.  R.  Richardson  and  T,  L.  Corwin.  However,  the  emphasis 
in  this  report  is  on  using  contact  data  to  revise  the  motion  models  or  scenarios 
in  a Bayesian  fashion.  In  previous  work,  Bayesian  methods  were  employed  to 
use  contact  data  to  predict  target  locations  but  not  to  revise  target  motion  scenarios. 

Observations  on  a single  target  are  assumed  to  be  in  the  form  of  contacts  with 
possibly  varying  degrees  of  localization.  There  are  two  pieces  of  information  which 
we  wish  to  obtain  from  these  contact  data;  First,  what  is  the  best  estimate  of  the 
present  target's  track,  and  second,  what  do  contact  data  on  this  particular  target 
tell  us  about  general  operating  patterns.  The  first  question  is  of  interest  primarily 
in  situations  where  contact  data  are  sparse.  We  answer  these  two  questions  in 
Chapter  III  by  devising  methods  for  computing  the  posterior  distributions  for  the 
present  target's  track  and  the  posterior  distribution  on  scenarios  given  a series  of 
contact  data.  The  main  effort  is  devoted  to  developing  a class  of  prior  distributions 
or  models  for  target  motion  which  is  rich  enough  to  represent  real  situations  but 
which  is  still  computationally  tractable. 

A totally  different  class  of  problems  in  ASW  Information  processing  is  posed 
by  a multi-target  environment.  In  Chapter  IV  the  concept  of  Extended  Memory 
processing  is  introduced  with  the  objective  of  resolving  contact  ambiguities  on 
multiple  targets.  When  trying  to  estimate  target  location  distributions  in  a multiple 
target  environment,  the  amount  of  computer  storage  required  to  retain  the  probability 
distributions  can  quickly  become  excessive  unless  the  target  distributions  are  all 
mutually  independent.  However,  even  when  the  prior  target  distributions  arc  assumed 
to  be  Independent,  they  lose  their  independence  as  soon  as  one  obtains  an  ambiguous 
contact,  i.  e. , one  which  cannot  be  positively  identified  as  being  on  a specific  target. 

In  order  to  resolve  this  problem  the  concept  of  Extended  Memory  processing 
considers  all  reasonable  assignments  of  contacts  to  targets  and  computes  the 
probability  that  each  of  these  assignments  is  correct.  For  numerical  reasons, 
assignments  with  very  low  probability  are  excluded  from  the  list.  The  crucial 
point  is  that  conditioned  on  an  assignment  of  contacts,  the  target  distributions 
are  mutually  independent.  The  composite  target  distributions  are  then  obtained 
as  averages  with  respect  to  the  assignment  probabilities  of  these  Independent 
distributions.  The  result  is  that  the  storage  requirements  are  roughly  linear  in 
the  number  of  targets  rather  than  exponential. 

In  addition,  by  considering  single  contacts  in  the  context  of  an  assignment  of  ajl 
the  contacts  to  targets,  it  often  happens  that  a contact  which  is  ambiguous  at  the 
present  time  will  be  resolved  by  future  contacts.  Thus,  the  Extended  Memory 
processing  has  the  capability  of  deferring  judgment  on  a contact  until  more  Information 
is  obtained. 
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Overall,  Extended  Memory  has  shown  Itself  to  be  an  effective  Bayesian 
statistical  technique  to  support  inforin:,.<^ioa  processing  in  a multi-target  environment. 
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OPTIMAL  SURVEILLANCE  AGAINST  A 
MOVING  TARGET  IN  A FALSE  CONTACT  ENVIRONMENT 


• 

A common  and  important  ASW  problem  is  the  allocation  of  surveillance 
assets  so  Ss  to  obtain,  and  maintain  over  time,  a specified  degree  of  target 
localization.  The  purpose  of  this  chapter  is  to  describe  an  idealized  surveillance 
situation  in  which  effort  is  applied  in  a false  contact  environment  with  the 
objective  of  obtaining  localization  information  on  a moving  target.  The  problem 
is  to  allocate  our  surveillance  assets  so  as  to  obtain  by  some  specified  future 
time  the  best  possible  estimate  for  the  target's  location. 

The  results  contained  in  this  chapter  can  be  viewed  as  an  extension  of  the 
approach  taken  in  Chapter  III  of  reference  [a].  In  reference  [a],  a number  of 
single-stage  look-ahead  surveillance  policies  were  formulated  and  then  compared 
using  Monte  Carlo  simulation  techninues.  In  this  current  work,  we  develop  a 
dynamic  programming  solution  for  the  optimal  multi-stage  surveillance  policy 
and  compare  it  with  a number  of  other  surveillance  policies  using  analytic 
techniques.  Asa  consequence  of  this  we  have  been  able  to  confirm  a number 
of  conjectures  which  were  made  in  reference  [a]  on  the  basis  of  Monte  Carlo 
studies.  Additionally,  our  analytic  techniques  have  yielded  new  insight  into 
the  moving  target  problem  and  have  resulted  in  some  new  conjectures. 

In  the  first  three  sections  of  this  chapter  we  describe  the  general  nature 
of  the  surveillance  problem  which  we  consider.  The  components  of  the  sur- 
veillance problem  are  defined  in  terms  of  the  sensor  response  capability  and 
certain  assumptions  concerning  target  motion.  The  Bayesian  updating  of 
target  location  probability  distributions  to  process  contact  data  according 
to  our  sensor  and  target  motion  assumptions  is  discussed  in  the  fourth  section. 

The  fifth  section  is  directed  toward  evaluaUng  the  effectiveness  of  a given 
surveillance  plan  against  a specified  target.  This  section  indicates  a number 
of  different  surveillance  measures  of  effectiveness,  and  develops  the  fundamental 
recursion  relationships  which  will  be  heavily  exploited  in  subsequent  sections. 

The  evaluation  of  optimal  surveillance  plans  is  described  in  the  sixth  and 
seventh  sections.  Such  plans  are  shown  to  be  the  solution  of  a certain  dynamic 
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programming  problem.  Unfortunately,  however,  the  computational  effort  required 
to  determine  the  solution  of  the  dynamic  programming  problem  is  immense.  For 
this  reason  we  consider,  in  the  next  section,  a class  of  suboptimal  surveillance 
plans,  called  stationary  plans,  which  are  in  many  cases  computationally  easier 
to  determine  than  the  optimal  plan. 

In  the  ninth  section  we  compare,  by  way  of  numerical  examples,  the  optimal 
surveillance  plan  to  a number  of  stationary  surveillance  plans.  One  of  these 
stationary  plans,  called  the  1-stage  look-ahead  maximum-information-gain 
plan,  is  shown  to  have  a number  of  very  desirable  properties.  In  particular 
it  appears  to  provide  near  optimal  target  localization  over  a variety  of  measures 
of  surveillance  effectiveness. 

One  striking  feature  of  the  numerical  examples  considered  is  the  rapid 
convergence  of  surveillance  effectiveness  as  the  length  of  time  of  the  surveillance 
operation  increases.  Accordingly,  the  asymptotic  behavior  of  optimal  surveillance 
plans  is  the  subject  of  the  tenth  section.  The  existence  of  a limiting  surveillance 
effectiveness  is  established  imder  a number  of  different  hypotheses.  Unfortunately 
we  have  as  yet  been  unable  to  determine  explicitly  the  value  of  this  limit.  Although 
the  asymptotic  results  contained  in  section  ten  are  of  interest  in  their  own  right, 
the  methods  developed  here  are  a particularly  important  step  in  placing  a 
theory  of  optimal  surveillance  on  a sound  theoretical  basis. 

The  next  three  sections  are  concerned  with  a special  type  of  surveillance 
sensor  called  a homogeneous  sensor.  ITie  importance  of  such  sensors  is  based 
on  the  fact  that  it  is  reasonable  to  model  many  operational  surveillance  sensors 
as  homogeneous  sensors.  Moreover,  our  numerical  examples  indicate  that 
the  optimal  surveillance  plan  for  such  a sensor,  when  employed  against  a 
target  which  satisfies  certain  motion  assumptions,  is  in  fact  a stationary  plan 
of  a particularly  simple  t3q>e. 

This  chapter  concludes  finally  with  a section  where  our  various  results 
are  siunmarized. 

The  Tactical  Situation 

Suppose  we  are  interested  in  performing  a s'mveillance  operation  against 

a stochastically  moving  target  which  is  located  m one  of  N cells,  C C 

The  precise  cell  containing  the  target  is  unknown,  but  at  the  beginmng  of  the 
operation  a probability  distribution  for  the  target's  location  has  been  established. 
VVc  desire  to  allocate  our  surveillance  assets  so  as  to  obtain,  at  the  end  of  tlie 
surveillance  operation,  a posterior  prdbability  distribution  for  the  target's 
position  which  localizes  the  target  as  much  as  possible. 

The  surveillance  operation  itself  is  performed  in  a sequence  of  discrete 
time  stages,  each  of  duration  A,  by  a single  movable  surveillance  sensor. 
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We  are  permitted  only  K stages,  and  we  must  use  the  sensor  so  as  to  obtain 
the  best  possible  estimate  for  the  target's  position  after  these  K stages.  At 
the  start  of  the  surveillance  operation,  the  ^ed  time  at  which  we  must  obtain 
the  best  estimate  of  the  target's  position,  the  horizon,  is  KA  units  in  the  future. 

At  the  beginning  of  each  stage,  one  or  more  cells  are  chosen  and  then 
investigated  for  a total  amount  of  time  at  most  A.  The  choice  of  cell  or  cells, 
in  general,  depends  upon  the  number  of  stages  remaining  in  the  operation, 

(i.  e. , the  amount  of  time  remaining  until  the  horizon),  the  surveillance 
capability  of  our  sensor,  our  target  motion  assumptions,  and  the  current  target 
location  probability  distribution,  TTie  sensor  response  results  are  then  used 
to  update  the  target  location  probability  distribution.  Target  motion  is  assumed 
to  take  place  at  the  end  of  each  stage  and  results  in  a new  target  location  distri- 
bution. The  next  stage  begins  with  this  new  target  location  probability  distribution 
and  is  the  beginning  of  a surveillance  operation  involving  one  fewer  stages.  Hie 
phasing  of  these  various  activities  is  indicated  in  Figure  II-l. 

Observe  that  the  surveillance  operation  described  above  is  dsmamic  in 
that  the  allocation  of  effort  at  each  stage  depends  upon  the  results  of  the  pre- 
vious stage.  This  is  comparable  to,  say,  a VP  operation  where  flights  are 
flown  daily.  The  results  of  each  day's  search  together  with  the  target  motion 
assumptions  are  then  used  to  decide  the  allocation  of  effort  for  the  next  day's 
flights. 


Sensor  Response  Assumptions 

Because  of  the  possibility  of  false  contacts,  a sensor  response  in  a given 
cell  does  not  necessarily  imply  that  the  target  is  in  that  cell.  Similarly  the 
lack  of  a sensor  response  in  a given  cell  does  not  necessarily  mean  that  the 
target  is  not  in  the  cell  being  investigated.  With  this  in  mind  we  define 
rij,  i,  j - 1,  2, , . . , N to  be  the  conditional  probability,  given  that  the  target 
is  in  cell  Ci,  that  investigating  cell  Cj  for  the  length  of  time  A will  result 
in  a sensor  response.  Moreover  we  will  assume  that  if  the  target  is  in  cell 
C^,  then  investigating  cell  Cj  for  the  length  of  time  t,  0 < t < a,  will  result 
in  a sensor  response  with  probability  rjjt/A. 

The  N X N matrix 
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PHASING  OF  VARIOUS  ACTIVITIES  IN  A SURVEILLANCE  OPERATION 


surveillance  surveillance  surveillance  surveillance  surveillance  surveillance 


is  called  the  sensor  response  matrix;  in  our  model  it  completely  characterizes 
the  surveillance  capability  of  the  sensor.  Note  that,  if  there  is  no  possibility  of 
false  contacts,  then  R is  a diagonal  matrix. 

In  order  to  simplify  our  calculations  we  will  suppose  that  there  can  be  at 
most  one  sensor  response  in  any  single  surveillance  stage.  Additionally  we 
will  suppose  that  sensor  responses  from  different  time  periods  are  statistically 
independent. 


Target  Motion  Assumptions 

Target  motion  is  assumed  to  be  a Markov  process  which  takes  place  at 
the  end  of  each  surveillance  stage.  Let  d^j,  i,  j = 1,  2,  ....  N,  be  the  conditional 
probability  that  the  target  will  move  to  cell  i given  that  it  originated  in  cell  j. 

The  following  N x N matrix,  called  the  transition  matrix. 
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completely  describes  the  target  motion  assumptions.  Observe  that  d. . = 1 
and  that  in  the  case  of  a stationary  target,  the  transition  matrix  is  simply  the 
identity  matrix,  M = I. 


If*  X = (x  , Xg, . . . , x^)'  is  the  target  location  distribution  before  target 
motion,  then  Y = ifcc  is  the  target  location  distribution  after  a single  stage  of 
target  motion.  Moreover,  assuming  that  no  surveillance  is  performed  between 
motion  steps,  the  target  location  distribution  after  k-motion  steps  is  Y = M*^X. 


Mathematical  Structure  of  K-Stage  Surveillance  Operations 

In  this  section  we  formulate  the  mathematical  structure  of  a K-stage 
surveillance  operation  performed  on  N cells,  C^,  C2, . . . , Cj^.  We  will 
assume  that  the  stochastic  structure  of  the  surveillance  operation  is  completely 


We  use  the  notation  X'  to  denote  the  transpose  of  a vector  X. 


-9- 


known,  that  is,  that  the  target  motion  matrix  M and  the  sensor  response  matrix 
R are  known. 

It  is  useful  to  introduce  the  positive  orthant  in  N-space 

^2 ^ ^ ' 

For  each  X = (x^, . . . , Xn)'«  Vj,.  we  define  ||  x|L  = Xj +. . . +Xj^.  Observe 
that  if  xjj  is  the  probability  that  the  target  is  in  cell  Cj^,  k - 1,  2, , . . , N, 
than  the  vector  X ==  (xj, ....  Xj^)'  is  an  element  of  the  N-1  dimensional  simplex 

■^N-l  ' " ''n  ' 

We  will  refer  to  ^ as  the  probability  state  space.  Every  target  location 
probability  distribution  on  N cells  Cj,  C2, ....  Cj^  can  be  represented  as  a 
point  of  the  probability  state  space. 

We  now  define  a K- stage  surveillance  plan  on  the  cells  C^,  C^, . . . , to 
be  a function  (p  x (l,  2, . . . , K}  - Vj^  such  that  1 1 (X,  k)  | < 1 for 

(X,  k)  € X {1,  2 K}.  In  order  to  explain  this  definition  suppose  that 

at  stage  k of  the  surveillance  operation,  1 < k < K,  the  current  target  location 
probability  distribution  is  X e k)  = (X,  k), . . . , <pj^  (X,  k)). 

The  surveillance  plan  (p  then  requires  that  we  allocate  to  cell  C the  amount 
of  surveillance  effort  (p^A,  I ^ 1,  2, . . . , N.  The  condition  1 1 ippi,  k)  | L < 1 
is  the  constraint  that  the  total  amount  of  surveillance  effort  available  at 
state  k is  A. 

We  suppose  that  the  response  capability  of  our  surveillance  sensor  is 
defined  by  the  N x IS  response  matrix  R - [rij].  Target  motion  is  assumed  to 
be  Markovian  and  is  defined  by  the  transition  matrix  M [djj].  The  diagonal 
matrices  Tj  diag(rij,  J'2j, . . . , r^j),  j - 1,  2,  ...,N,  will  prove  particularly 
useful  in  the  following  development.  Observe  that  Tj  completely  characterizes 
the  sensor  capability  of  investigating  cell  Cj. 

At  the  beginning  of  stage  k of  our  surveillance  operation  let  the  (prior) 
target  location  probability  distribution  be  X c We  will  now  determine 

the  various  (posterior)  target  location  probability  distributions  possible  at 
the  end  of  stage  k.  and  indicate  the  probabilities  with  which  they  will  occur. 
Suppose  that  ip(K,  k)  (ip^  (X,  k), . . . , v?j^(X,  k))  so  that  we  are  to  allocate  to 
cell  the  amount  of  surveillance  effort  (X,k)A.  It  follows  then  that 
the  probability  of  obtaining  a sensor  response  from  cell  / is  (p,  (X,  k)  [[  T^  X [( j. 
Moreover  it  follows  from  Bayes'  theorem  that  the  posterior  target  location 
probability  distribution  given  a sensor  response  in  cell  is  T X/||  T X | j j. 
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Since  we  are  assuming  at  most  one  sensor  response  in  each  surveillance 
stage,  it  follows  that  the  prdbability  of  obtaining  no  response  is 
1 - 2 (X,  k)  1 1 X 1 1 Additionally  the  posterior  target  location 

probability  distribution  given  no  sensor  response  is 


N 

a - s 
1=1 


(X,k)T  )X/(1- 


N 

2 

1=1 


(X,k)T  X 


In  order  to  obtain  the  (posterior)  target  location  probability  distributions 
at  the  end  of  stage  k it  is  now  only  necessary  to  apply  the  target  motion  matrix  M 
to  each  of  the  above  distributions.  We  thus  define 


(X.k) 


N 

a - 2 <^  (X,k)T)X 
i=l  •’ 

N 

1-  2 |[<p  (X.k)TX|| 
j=l  i i ^ 


I = 0. 


I = 1,  2 N. 


(U-1) 


I 

■J 


Note  that  Uq(X,  k)  is  the  posterior  target  location  distribution  at  the  end  of 
stage  k given  that  the  prior  distribution  at  the  beginning  of  stage  k is  X and 
that  there  were  no  sensor  responses  during  stage  k.  Similarly  U (X.k), 

I = 1,  2, . . . , N,  is  the  posterior  target  location  probability  distribution  at 
the  end  of  stage  k given  that  the  prior  distribution  at  the  beginning  of  stage  k 
is  X and  that  there  was  a sensor  response  from  cell  , It  is  interesting 
to  observe  for  / = 1,  2, , , , , N that  U (X,  k)  does  not  depend  on  the  amount 
of  surveillance  effort  applied  to  cell  and  so  in  particular  is  independent  of 
the  surveillance  stage  k.  This  results  from  the  linearity  of  our  sensor  response 
assumptions. 

So  as  to  simplify  our  notation  further,  define 
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(x,k) 


1-  Z ||,/,(X.!r>TXl|,  I = 0 
i=i  ' J 


(X.k)T^X||, 


(n-2) 


l = 1,  2 N, 


The  quantity  6^0^,  k)  is  the  probability  of  obtaining  no  sensor  response  at  stage  k 
given  that  the  prior  target  location  distribution  at  the  beginning  of  stage  k is  X. 
Similarly,  for  I = 1,  2, ....  N,  0 (X,  k)  is  the  probability  of  obtaining  a sensor 
respcaise  from  cell  at  stage  k given  tihat  the  prior  target  location  distribution 
at  the  beginning  of  stage  k is  X, 

Observe  next  that  if  X is  the  prior  target  location  probability  distribution 
at  the  beginning  of  stage  k,  then  MX  is  the  expected  posterior  target  location 
probability  distribution  at  the  end  of  stage  k.  Indeed  we  have 


MX  = z 0,  (X,k)  U,  (X.k). 


Consider  now  a surveillance  operation  lasting  K stages.  Let  Xq  be  the 

initial  target  location  probability  distribution  and  let  X^  , I =1,  2 K, 

be  the  target  location  probability  distribution  at  the  end  of  stage  I . Each 
X^  is  a random  vector  and  the  mapping  X^  defines  a discrete  time  Markov 

process  on  the  probability  state  space  each  X e 

and  each  Borel  measurable  set  A C we  define  for  k = 1,  2, ....  K, 


Pk(X,  A)  ^Z^  0^  pt,k)/Xu^  (X,k)^^^’ 


where  /iy(*)  is  the  unit  point  mass  concentrated  at  Y,  then  pj^(.,  •)  is  the  stochastic 
transition  function  for  stage  k of  the  Markov  process. 

A major  question  in  the  theory  of  surveillance  is  the  distribution  of  the 
random  variable  X^  , f 1,  2, ....  K.  Indeed,  the  distribution  of  the  X^ 
indicates  the  relative  likelihood  of  the  various  target  location  probabilify 
distributions  that  can  arise  from  a given  surveillance  plan.  Define  inductively 


p''(Xo.A)  P,(Xq.A) 


P^'  ^(Xq.  A)  / p<^  A)p^  (X^,  dY),  I 2.3,.,..  K. 

*^'-1 


It  follows  that  P^Q'  • ) is  the  probability  distribution  for  the  target  location 
probability  distribution  at  the  end  of  stage  I , given  that  the  initial  target 
location  probability  distribution  was  Xq.  It  is  an  immediate  consequence  of 
equation  (11-3)  that 

Yp<^>(X^,dY), 

that  is  the  mean  target  location  probability  distribution  at  stage  I , given  that 
the  initial  distribution  was  X^,  is  Xq. 


Evaluating  Surveillance  Plans 

In  order  to  determine  the  effectiveness  of  a surveillance  operation,  it  is 
first  necessary  to  establish,  for  each  target  location  probability  distribution, 
a measure  of  the  localization  information  implied  by  that  distribution.  There 
are  a tremendous  number  of  possibilities  for  such  a measure  of  effectiveness. 
Suppose  for  example  that  X - (xj^, , . . , j is  the  current  target's  location 

probability  distribution.  Our  best  single-cell  estimate  for  the  target  location 
is  that  cell  which  contains  the  target  with  highest  probability.  Indeed  if 
xii  > xi2  > . . . > xi^,  then  the  target  is  most  likely  to  be  located  in  cell  Cjj, 
and  the  probability  that  this  estimate  is  correct  is  If  this  is  to  be  our 
measure  of  effectiveness,  it  is  clear  then  that  our  surveillance  effort  should 
be  applied  so  as  to  maximize,  at  the  end  of  the  surveillance  operation,  the 
probability  that  the  target  is  in  the  highest  probability  cell.  Other  closely 
related  measures  of  effectiveness  would  be,  for  n 1,  2, , , . , N,  the  probability 
that  the  target  is  located  in  the  first  n highest  probability  cells  Xi ^ +. . . + Xj  . 

We  are  thus  led  to  define  the  functions  fj, . . . , f on  the  probability  state 
space  by  f„  (X)  Xj^  +. . . f Xi^. 

As  another  example  of  a possible  measure  of  surveillance  effectiveness, 
suppose  that,  at  the  end  of  a surveillance  operation,  we  will  search  for  the 
target  with  a specified  sensor.  The  goal  of  this  search  is  to  obtain  a target 
detection,  and  we  can  measure  the  amount  of  localization  information  implied 
by  a target  location  probability  distribution  in  terms  of  parameters  associated 
with  this  search.  For  example  we  can  use  the  expected  time  to  detect  the 
target  with  the  search  sensor  as  one  measure  of  the  localization  provided  by 
a probability  distribution.  If  the  search  has  a limit  on  the  total  available 
effort,  then  we  can  use  as  our  measure  of  effectiveness  the  probability  of 
obtaining  a detection  within  the  specified  time  limit.  Observe  that,  in  the 
case  of  a perfect  sensor*  which  is  only  permitted  to  search  one  cell,  the 


A perfect  sensor  detects  the  target  if  and  only  If  it  searches  in  the  cell  w’hlch 


f contains  the  target, 

I 

I 
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maximum  probability  of  obtaining  a detection  is  the  probability  that  the  target 
is  located  in  the  highest  probability  cell. 

As  a final  example,  we  can  suppose  that  the  goal  of  our  surveillance  operation 
is  to  obtain  a posterior  target  location  probability  distribution  which  contains 
the  maximum  information  possible.  Recall  that  if  x^  is  the  probability  that 
the  target  is  located  in  cell  , I = 1.  2, , , . , N,  the  information  content  of 
the  probability  distribution  X = (xj, , . . , Xj^)'  is 


^(X)  = InN  + 


Although  we  will  be  chiefly  concerned  with  the  measures  of  effectiveness 

given  by  the  functions  fj,  f2 fjg  defined  above,  it  is  generally  useful 

to  have  a larger  class  of  effectiveness  measures.  For  this  reason  we  define 
the  space  of  objective  functions  C to  be  the  space  of  all  convex  functions  f 
defined  on  which  satisfy  the  uniform  Lipschitz  condition:  there  is  a 

constant  L > 0 (which  depends  on  f)  such  that  if  Xj,  X2  e t^N-1 


f(X^)  - f(X2)  I < L II  Xj  -X^ 


It  is  interesting  to  note  that  the  convexity  of  f e C implies  that  f satisfies  the 
above  uniform  Lipschitz  condition  on  each  compact  subset  of  See  for 

example  reference  [ e ],  Thus  our  Lipschitz  condition  is  only  a restriction  near 
the  boundary  of 

Because  of  the  convexity  assumption  observe  that  for  X j,  X2, . . . , X^  e «^-l 
each  objective  function  f t C satisfies 


rii 

f(  2 0,  X, ) < Z 0,  f(X, ) whenever  (?,  > 0,  0,  +. . . +0  = 1. 

k^l  k k - k 1 k K k—  1 m 


In  particular  f attains  its  maximum  at  a vertex  of  *^-1’  so  that  the  greatest 
return  provided  by  the  objective  function  is  when  the  target  is  localized  to  a single  cell. 

Suppose  now  that  we  wish  to  evaluate  the  K-stage  surveillance  plan  0 using 
the  measure  of  effectiveness  given  by  the  objective  function  f t C.  For  each 
X i.  concern  is  with  the  function 
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EV(</>,f)  (X)  = 


expected  /^.lue  of  f at  the  end  of  the 
surveillance  operation  cp  given  that 
the  initial  tar^t  location  distribution 
isXe 


The  function  EV  indicates  the  amount  of  target  localization,  in  terms  of  the 
objective  f,  that  we  can  expect  at  the  end  of  the  surveillance  operation  ip. 

The  most  convenient  method  of  computing  the  function  EV  is  to  introduce, 
for  each  k = 0,  1,  2, , . . , K and  each  X e the  functions 


I expected  value  of  f resulting  from  following 
the  last  k stages  of  plan  cp  given  that  there 
are  k stages  remaining  and  that  the  target 
location  probability  distribution  at  the 
current  stage  (stage  K-k)  is  X. 


The  function  A {0,  f)  indicates  the  expected  target  localization  that  can  be 
achieved  relative  to  the  objective  f given  that  there  are  k stages  remaining 
in  the  surveillance  operation.  In  particular  = EV((p,  f). 

We  will  compute  the  functions  f),  k = 0,  1,  2, . , , , K,  inductively. 
It  is  evident  that  f)  f.  Suppose  that  A^(^,  f)  is  known  and  that  we  are 

to  compute  f).  Let  0 = (<)7j^(X,  k), . . . , <^j,j(X.  k)).  If  the  prior  at  the 

beginning  of  stage  K-k  is  X,  we  are  to  allocate  to  cell  Cj  the  amount  of 
surveillance  effort  (X,  K-k),  In  the  notation  of  equations  (II- 1)  and  (II -2), 
we  will  obtain  a sensor  response  from  investigating  cell  C,  with  probability 
0 ^ (X,  K-k)  and  in  this  case  the  posterior  probability  distribution  is 

(X,  K-k).  Similarly,  the  probability  of  obtaining  no  sensor  response  is 
OQ^i,  K-k)  and  the  resulting  posterior  target  location  distribution  is 
Uq(X,  K-k).  It  follows  then  from  the  definition  of  A^'''l((^,  f)  that 


k+1  ^ k 

A (0,  f)  (K)  ^ (X,  K-k)  f)  (U^  (X,  K-k)). 


(11-6) 


It  is  possible  to  simplify  equation  (II-6)  somewhat  by  extending  A {0,f)  (•) 
to  all  of  by  defining 


I 


itY£  v^,  Y^O 
if  Y ^ 0. 

The  function  A (cp,  f)  (•)  defined  in  this  manner  is  homogeneous  in  the  sense  that 

A‘^(cp.f)  (XY)  - XA^(<^.f)  (Y).  all  X >0,  Ye 

Using  now  the  relation  (U-7)  and  the  definitions  (II-l)  and  (II-2),  we  can 
write  equation  (II-6)  in  the  form 

k+1  k N 

A (<p.  f)  (X)  ^ A (cp,  f)  ( cp^  (X,  K-k)  M a-T^  ) X) 

N k 

+ 2 </),  (X,  K-k)  A^((/>,  f)  (MT  X). 

1 = 1 '■  ^ 


(U-7) 

i 


(11-8) 


The  recursion  relationships  (11-6)  and  (II-8)  are  fundamental  in  that  they  enable 
us  to  compute  analjrtically  the  effectiveness  of  any  given  surveillance  system. 
We  will  heavily  exploit  these  relationships  in  the  subsequent  sections. 


Optimal  Surveillance  Plans 

The  purpose  of  this  section  is  to  provide  a method  for  determining  the 
optimal  K-stage  surveillance  plan  for  any  objective  function  f r C,  A K-stage 
optimal  surveillance  plan  cpl^  is  characterized  by  the  condition 


EV(<p^,f)  >EV(^,r) 


i 


for  any  surveillance  plan  cp.  This  is  to  say  that  at  the  end  of  the  surveillance 
operation  (i.e. , at  the  end  of  stage  K)  the  optimal  surveillance  plan  yields 
the  greatest  possible  expected  target  localization  relative  to  the  measure  of 
effectiveness  f. 


In  order  to  determine  the  K-stage -optimal  surveillance  plan  c'>^  consider 
the  functions  A^  , / 0,  1,  2, , , , , K defined  by  equation  (II-5).  The  iterative 

scheme  (11-8)  and  the  fact  that  A^((^*^,  f)  EV((^K  f)  force  immediately  the 
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inequalities 


Z = 0,  1,  2....,K, 


for  all  surveillance  plans  cp.  With  this  we  can  now  inductively  compute  the 
functions  A^  (^,  f),  1 = 0,  1,  2, . . . , K.  Recall  first  that  A®((/?K,  f)  . f.  Now 
suppose  that  A^((^K,  f)  has  been  determined.  It  follows  immediately  from 
equations  (II-8)  and  (II-9)  that 

= max{FV^,Q!2....,aj^.X)  : a > 0,  a ^ + a < 1} . 


where 


_k, 
F {a 


• • • • 


Q-^X)  - A‘^(<;j^f)(  r M(I-a!  T )X)  + ^ o A*'(cP^.f)(MT  X). 
IN  1=1  I I 1=1  ''  ^ 


. Z TC 

It  is  important  to  observe  that  the  functions  A (</>^f)(X).  I = 0,  1,  2,...,N, 
are  convex  functions  of  X e «5n-1‘ 

is  by  assumption  a convex  function  of  X.  We  now  argue  by  induction:  suppose 
that  A^(<^^,  f)  is  a convex  function  on  •9n_i.  Then  A*^(</>^,  f)  is  a convex  function 
on  Vn  and  so,  by  equaUon  (II-ll),  Fk(a  . . .,a^,X)  is  a convex  function  of 
X e ,^_1.  It  follows  then  from  (II-IO)  that  A^+T((;p^,  f)pC)  is  a convex  function 
of  X € ^N-l*  desired  result. 

Suppose  now  that,  for  a given  X e the  numbers  a j*, . . . ,a^  satisfy 


fV. 


a . 


. + a 


N 


^ < 1. 


It  is  evident  then  that  a K -stage  optimal  surveillance  plan  (p  can  be  achieved 
by  allocating,  at  stage  K-k,  the  fraction  of  the  available  effort  to  cell  , 
i,  e. . (p^(X,  K-k)  = (a  j*, , . , , a j^*).  We  claim  that  the  optimal  allocation  of 
surveillance  effort  can  always  be  achieved  by  allocating  the  total  available 
effort  to  a single  cell.  As  we  shall  see,  this  is  a consequence  of  the  linearity 
of  the  sensor  detection  process. 
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kl 


(II-9) 


(R-IO) 


(H-ll) 


J 


Writing  now 


N 


N 


M(I  - Z a,  T,  ) X = (l-^)MX+/3  (MX  - Z T X), 
1-1  I I 1 = 1 P ^ 


Ic  TC 

it  follows  from  the  convexity  of  A , f)  (•)  that 


fV  Q!jJ,X)  <(1  - /3)  A^(<p^f)  (MX) 


k O',  a a^. 

+ ^ ir  {—L,-2l X), 

.3  ^ /? 


But 


f)(MX)  < A*^  (<p^,  f)(M(I  - ) X) 


vk  /,„K 


a . 

1 


and  so  we  conclude  that 


N Q! . k K 

+ Z -i  A^  (AO  (T,  ,X) 
1 = 1 0 


I 


a . 


a 


N 


fV*., 


,k  .“l 


N 


..Oj^.X)  <F  X) 


with  equality  if  and  only  if 


k+ 1 k K 

/3  = 1 or  A (<^*^.  f)  (X)  = A (cp  . f)  (MX). 

In  order  to  interpret  the  second  condition  in  (11-12),  observe  that  if  X e ^ 
is  the  target  location  probability  distribution  with  k+1  stages  remaining  in 
a surveillance  operation,  then  MX  is  the  probability  distribution  with  k stages 
remaining  assuming  that  no  surveillance  was  performed.  The  second  condition 
in  (11-12)  says  then  that  the  increment  in  the  expected  target  localization 
produced  by  the  optimal  surveillance  plan  with  k+1  stages  remaining  is 
the  same  as  if  no  surveillance  is  performed  at  that  stage.  We  conclude  then 
that  the  optimal  surveillance  plan  requires  that  we  use  the  total  amount  of 
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(Il-12\ 


surveillance  effort  available  at  a given  stage  0=1),  or  else  the  available 
surveillance  effort  at  the  stage  in  question  is  insufficient  to  affect  the  result. 

Without  loss  of  generality  we  can  now  assume  that  in  equation  (II-IO) 
o ^ r . . .+  Qiisj  ^ 1,  Subject  to  this  condition  we  then  have 


(Af)  ( 2 Ma-T^)X) 
/ 1 

N k K 

+ 2 O',  A ((p  (MT  X). 
f=l  ^ ^ 


k X k 

But  since  A ((^  , f)pc)  is  a convex  function  of  X,  it  follows  that  F (a  ^ Q?  X) 

is  a convex  function  in  the  variables  o?  . . . .o'jq.  and  so  it  achieves  its  maximum 

at  one  of  the  vertices  of  the  convex  simplex  {(a^ 0!j^)  : Oj  > 0,  o;  = 1}  , 

Thus  as  claimed  the  optimal  surveillance  plan  can  always  be  achieved  by  allocating 

the  entire  available  effort  to  a sii^le  cell. 


Finally  observe  that  the  K -stage -optimal  surveillance  plan  depends  only 
on  the  number  of  stages  remaining  in  the  surveillance  plan.  Thus  if  Kj  < K2 
we  have 


A 


k 


k=  0.  1,  2 Kj. 


Evaluating  Optimal  Surveillance  Plans  Numerically 

In  the  previous  section  we  described  a method  for  determining  a K-stage- 
optimal  surveillance  plan  for  any  objective  function  f e C and  any  sensor 
response  and  target  motion  matrices  R and  M,  respectively.  Unfortunately, 
the  method  requires  the  solution  of  a dynamic  programming  problem,  and  like 
such  solutions  in  general,  it  requires  an  enormous  amount  of  computational 
effort  to  implement.  In  particular,  as  the  number  of  cells  increases,  the 
dimension  of  the  probability  state  space  increases. 


For  example,  consider  a surveillance  problem  on  N cells,  C, , Co, . . . , C... 

1 1 ^ ^ iN 

Recall  that,  in  order  to  determine  A f),  we  must  know  the  values  of 

f) 

on  In  order  to  have  the  values  of  A^((^K,  f)  available  on  a 

computer,  we  must  quantize  the  state  space  ^n-1  ^"1°  ^ finite  number  of 
points  at  which  the  values  of  A^(<^X  f)  are  stored.  The  values  of  A^((^?K,  f) 
at  other  points  are  then  estimated  by  an  interpolation  scheme. 


Suppose  that  we  decide  to  partition  each  coordinate  axis  of  into  J 
equally  spaced  points.  Our  probability  state  space  is  then  quantized  into  the 
points  (ij,  i2, , . . , where  i^, , . , , are  nonnegative  Integers  such  that 

ij  +. . . + = J.  Observe  that  1/J  is  the  coordinate  resolution.  It  is  easy  to 
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see  that  the  number  of  points  in  such  a quantization  is 


fT')- 

Ideally  a coordinate  resolution  of  . 01  or  better  is  highly  desirable.  Observe, 
however,  from  Table  II-l  that  such  a resolution  is  practical  for  at  most  three 
cells.  To  solve  a four -cell  optimal  surveillance  problem  clearly  requires  a 
resolution  not  much  finer  than  . 05.  Unfortunately,  such  a resolution  is  barely 
adequate  to  guarantee  a good  approximation  to  the  functions  f),  since 

it  provides  only  1.  5 significant  decimal  places.  Considerable  programming  care 
must  thus  be  exercised  to  insure  valid  results,  even  for  as  few  as  four  cells. 

There  are  a number  of  special  features  of  the  functions  Ak((^K,  f)  which 
can  be  exploited  so  as  to  ease  their  evaluation.  First  note  that  these  functions 
are  convex,  so  that  a piecewise  linear  interpolation  with  vertices  at  the  points 
(il, . . . , if,j)J“^  will  provide  an  upper  bound.  We  now  would  like  to  estimate  the 
accuracy  of  this  piecewise  linear  approximation. 

Recall  that  the  space  of  objective  functions  C is  the  space  of  all  convex 
functions  on  which  are  uniformly  Lipschitz  continuous  in  the  sense  that 

if  f e C there  exists  L > 0 such  that 


f(X^)-f(X2)|  < L 


^1-^21 


'^-1' 


Each  function  f t C can  be  extended  to  a function  defined  in  the  positive 
orthant  in  N- space 


(yj,...,yj^)  : Yj  > 0,  j 


by  setting 


Number  of  Cells 


Suppose  now  that 


|f(X^)-f(X2)|<  alix^,  X^  e 

If  Yj,  e Vj^,  Yj,  Y^  ^ 0,  it  is  easy  to  see  that 

|f(Yp-f(Y2)|<L(2+max{f(X)|X€^_^})  ||  Y^-Y^H^ 


and  SO  f is  also  equicontinuous  on  The  smallest  constant  L such  that 

N 


I f(Y^)-f(Y2)  I < L II  Y^-Y^  II  . all  Y^.  Y^  e 

is  called  the  modulus  of  continuity  of  f (on  Vjq).  The  modulus  of  continuity 
of  f provides  a usable  estimate  for  the  fineness  of  a quantization 
required  to  achieve  a piecewise  linear  approximation  for  f with  a specified 
accuracy.  'Ihe  following  theorem  relates  the  modulus  of  continuity  of  the 
functions  0 to  that  of  f.  and  so  provides  estimates  for  the  accuracy 

of  approximations  for  f), 

'fheorem  II-l.  Let  be  the  K-stage  optimal  surveillance  plan  for  the 
objective  function  f < C.  If  the  modulus  of  continuity  of  f on  Vj^  is  Lq,  then 

f)  is  Lipschitz  continuous  with  modulus  of  continuity  Lj^,  k 1,  2, . . . , K. 
Additionally 


V 


Proof.  Suppose  that  the  sensor  response  matrix  is  R and  the  motion 
matrix  is  M.  The  proof  proceeds  by  induction  on  k.  First  note  that  the  result 
is  trivally  valid  for  k 0.  We  assume  now  that  the  result  is  valid  for  0,  1,  2, . . . . k; 
we  must  verify  it  for  k+1. 


Now  since  the  optimal  surveillance  plan  can  always  be  achieved  by  allocating 
the  entire  available  effort  to  a single  cell,  it  follows  from  equation  (11-11)  that 


Now  let  X^,  and  suppose  that 

,k+l,  K ^ .k+1,  K , 

A {cp  ,f)(X^)>A  ((p  ,f)(X2). 

If 

A^^\/,£)(Xp  = A^(/.f)(MT^X^)  + AN/,f)(M(I-T,  )X^) 
it  follows  that 

I A^^\/,f)(X^)  - A*'^\/.f)(X2)| 

< I A^^\/,f)(X^)  - A^(<p^,{){MTX^)  - A^,/j^f)(M(I-TpX2)| 

= ( A^{<p^,t)iMT^X^)  + A^cp^,f)(M(I-T^)X^) 

- A^(/,f)(MT^X2)  - A^(co^f)(M(l-T^)X2)  | . 

Using  now  our  induction  hypothesis,  we  obtain 
I fXXjl-A*^'"^/,  fXX^)  I < L {||  MT  (Xj-X^)  II  ^ 

+ ||m(i-t  )(X^-X2)||j}. 

But  1 1 MU  1 1 j < 1 1 U 1 1 for  any  vector  Ue  IR^,  so  that 

I A^^^{(p^  \ f)pc^)  - A^^  OfX^)  I < L { j I ^ ^ 1 

^ ll(I-T)  (X^-Xg)!!^} 
-LIIX^-X^II^ 
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and  the  desired  result  is  verified. 


I 

r 

[ 


( 

[ 

I 


i 


It  is  easy  to  construct  examples  in  which  L = , ==  Indeed, 

consider  a two  cell  surveillance  problem  against  a stationary  target  involving 
a uninformative  sensor  response  matrix 


R = 


c 


c 


0 < c < 1. 


Then  for  any  K- stage  surveillance  plan  (p  (including  the  K-stage  optimal  plan) 
A^(<^,f)  - f,  k=0,  1,  2,...,K. 


Stationary  Surveillance  Plans 

In  most  cases  the  K-stage  optimal  surveillance  plan  depends  strongly  on 
the  horizon.  Thus  it  may  happen  that  the  surveillance  policy  designed  to  obtain 
the  best  possible  estimate  at  one  specified  future  time  is  suboptimal  for  a 
different  horizon. 

Suppose  however  that  wo  are  interested  in  the  maintenance  of  target 
localization  over  time  relative  to  some  objective  function  fe  C , In  particular 
such  a surveillance  plan  does  not  end  at  any  specified  horizon,  but  rather 
continues  indefinitely  over  time. 

Since  the  K-stage  optimal  surveillance  plans  q?  depend  on  the  number  of 
stages  until  the  horizon  and  not  on  the  number  of  stages  since  the  beginning 
of  the  operation,  such  plans  are  generally  not  extendable  as  optimal  plans  to 
horizons  different  than  the  initially  specified  horizon.  It  is  thus  desirable 
to  find  physically  realizable  surveillance  plans  which  are  nearly  optimal  for 
any  horizon. 

The  simplest  surveillance  plans  which  are  extendable  to  any  possible  horizon 
are  the  stationary  surveillance  plans.  A K-stage  surveillance  plan  (p  defined 

on  N cells  C^,  C2 is  stationary  if  it  depends  only  on  the  current  target 

location  probability  distribution  X and  not  on  the  current  stage  of  the 

operation.  Such  a plan  is  a function  X (1,  2 K}  “*•  such  that 

iJ  <pII  < 1 which  is  of  the  form  <p(X,  1)  - </j(X,2)  = . . . = (^(X,  K). 

Such  a plan  is  obviously  extendable,  as  a stationary  surveillance  plan, 
to  any  number  of  stages  K, 

Suppose  that  (p  is  a stationary  surveillance  plan  and  that  f f C is  our  localization 


L 


-24- 


objective  function.  If  X is  the  prior  target  location  distribution,  the  expected 
degree  of  target  localization  obtained  by  after  k stages  is  A^((;£>,f)(X),  Thus 
the  expression 


K+1  k=0 


AN<p.f)(X) 


is  the  average  expected  target  localization  during  the  first  K-stages.  If  K is 
sufficiently  large,  this  expression  represents  the  long  term  expected  target 
localization  one  can  expect  to  maintain  with  q). 

For  each  k let  q)^  be  the  K-stage  optimal  surveillance  plan  relative  to 
feC.  Observe  that 


.k.  - .k,  K _ 

A (</J,  f)  < A (q>  ,f). 


Thus  although  the  K-stage  optimal  surveillance  plans  q>  may  not  be  physically 
realizable  for  the  long-term  maintenance  of  target  localization,  they  do  provide 
an  upper  bound  for  the  expected  localization  that  stationary  plans  can  achieve. 
This  fact  is  useful  in  determining  how  close  to  optimal  a particular  stationary 
surveillance  plan  localizes  a given  target. 


Examples  Comparing  Optimal  and  Stationary  Surveillance  Plans 

In  this  section  we  compare  numerically  the  localization  achieved  by  the 
optimal  surveillance  plan  to  that  which  can  be  obtained  by  several  stationary 
plans.  The  particular  surveillance  system  considered  is  operating  against  a 
target  located  in  one  of  four  cells. 

We  assume  that  our  surveillance  system  has  the  sensor  response  matrix 


".15 

.01 

.01 

.09  - 

^ — z. — target  in  cell  C 

.01 

.15 

.01 

. 09 

.01 

.01 

.15 

.09 

-.01 

.01 

.01 

.1  _ 

investigate  in  cell  Cg 


Thus  if  we  investigate  cell  C2  and  the  target  is  located  in  cell  C^,  the  pro- 
bability of  obtaining  a sensor  response  is  . 01.  Note  that  the  sensor  capability 
achieved  by  investigating  any  of  the  first  three  cells  is  the  same.  Additionally, 
if  the  prior  target  location  distribution  is  uniform,  X = (,  25,  .25,  . 25,  . 25), 
a contact  obtained  from  investigating  any  of  the  first  three  cells  will  localize 
the  target  to  that  specific  cell  with  probability  . 83, 

Investigating  cell  with  this  sensor  is  not  as  informative  about  the  target's 
location  as  is  investigation  of  any  of  the  other  cells.  To  see  this,  observe  that 
if  the  prior  target  iccadon  distribution  is  uniform,  a contact  obtained  from 
investigating  cell  C4  will  localize  the  target  to  that  cell  with  probability  only 
. 27.  This  is  in  comparison  with  the  corresponding  value  . 87  obtained  by 
investigating  any  of  the  first  three  cells. 

We  also  assume  that  the  target  motion  matrix  is  given  by 
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.05 
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.05 

.05 
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Thus  if  the  target  is  located  in  one  of  the  first  three  cells  it  will  remain  there 
with  probability  . 85  and  move  to  any  one  of  the  other  cells  with  probability  . 05. 
Similarly  if  the  target  is  located  in  cell  C4  it  will  remain  there  with  probability 
, 91  and  move  to  any  one  of  the  other  cells  with  probability  ,03.  It  can  be  shown 
that,  for  each  X e ^3 


Urn  = (.214,  .214,  .214,  .358).  j 

k-»oo  j 

Thus,  if  no  surveillance  effort  is  applied,  the  target  location  distribution 
will  asymptotically  approach  (,  214,  .214,  .214,  .358).  In  particular,  on 
the  basis  of  our  target  motion  assumptions,  the  target,  after  a sufficiently 
long  period  of  time,  is  most  likely  to  be  located  in  cell  C4,  and  this  estimate 
for  the  target's  position  will  be  correct  with  probability  . 358. 

Ihe  effect  of  surveillance  is  to  increase  our  knowledge  about  the  target's 
position  beyond  that  which  can  be  achieved  from  our  target  motion  assumptions 
alone.  The  amount  of  increase  achieved  by  a particular  surveillance  plan  is 
a measure  of  the  effectiveness  of  the  surveillance  system  when  deployed  according 
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to  the  specified  plan.  To  illustrate  this  point,  we  compare  the  effectiveness  of 
a number  of  different  surveillance  plans  against  four  hypothetical  targets 
which  are  distinguished  only  by  having  different  prior  probability  distributions. 
The  four  prior  target  location  distributions,  together  with  their  corresponding 
no-surveillance  posterior  distributions,  are  indicated  in  Table  II-2, 

The  surveillance  plans  which  we  consider  here  are  a K-stage -optimal 
surveillance  plan  and  the  three  stationary  surveillance  plans.  Tlie  K-stage - 
optimal  surveillance  plan  has  as  its  objective  function  the  probability  that  the 
target  is  located  in  the  highest  probability  cell.  The  stationary  plans  are 
referred  to  as  the  1-stage  look-ahead  raaximum-information-gain  policy,  the 
3-stage  look-ahead  maximum-information-gain  policy,  and  the  highest- 
probability-cell  policy. 


The  1 -stage  look-ahead  maximum -information-gain  policy,  denoted 
allocates  its  effort  so  as  to  maximize  the  expected  information  content 
of  the  posterior  distribution  after  1 stage  of  surveillance  and  target  motion. 
For  X = (xi,  X2,  X3,  X4)'e^^,  let  t^(K)  be  the  information  content  of  X, 


1’ 


4 

4^(X)  = 2 X.  In  X.  + In  4, 

j=l  J J 

and  let  <p*  be  the  one  stage  optimal  surveillance  plan  for  the  objective  function 
Then  cp*  satisfies 


i 


for  all  1 -stage  surveillance  plans  *(p.  We  then  define  the  stationary  surveillance 
plan  by 
1 

<p  (X,  k)  = <p*Qi.,  1)  for  all  k = 1,  2,  3 

1 

The  3 -stage  look-ahead  maximum -information-gain  plan,  denoted  (Pi^  is 
defined  similarly.  Let  cp*  be  the  three-stage  optimal  surveillance  plan  for 
the  measure  of  effectiveness  The  plan  cp*  then  satisfies 


Note  that,  although  fails  to  satisfy  a uniform  Lipschitz  condition  and  hence 
C,  we  can  still  easily  define  A^(*  ,t^). 
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for  every  surveillance  plan  <p.  We  then  set 


(P  fX,k)  = (p+(X,  1),  k=l,  2,  3 

3 


Thus  <pj.  is  the  stationary  surveillance  plan  which  at  each  stage  allocates 
effort  according  to  the  first  stage  of  <p*. 

Finally,  because  of  its  great  intuitive  appeal,  we  consider  the  highest 
probability-cell  policy.  This  surveillance  plan  allocates  at  each  stage  the 
entire  available  effort  to  one  of  the  cells  which  currently  contains  the  target  with 
highest  probability. 

For  each  initial  target  location  distribution  and  each  surveillance  plan 
under  study,  we  have  graphed,  in  Figures  II-2,  11-3,  11-4,  and  n-5,  the 
probability  of  correctly  specifying  the  target's  location  at  the  end  of  the 
operation  as  a function  of  the  number  of  stages  in  the  surveillance  operation. 

Our  measure  of  effectiveness  is  the  probability  that  the  target  is  located  in 
the  highest  probability  cell.  Observe  that  for  each  K,  as  expected,  the  K-stage 
optimal  surveillance  policy  yields  the  greatest  probabilily  of  correctly  specifying 
the  target's  position  at  the  end  of  the  surveillance  operation.  Similarly,  if  no 
surveillance  is  performed,  \.e  achieve  the  least  amount  of  target  localization. 

The  two  maximum  information  gain  policies  perform  almost  as  well  as  does 
the  optimal  policy,  and  both  perform  better  than  the  highest  probability  cell 
policy. 

In  particular,  in  Case  I (see  Figure  II-2),  the  initial  target  location  probability 
distribution  is  (1,  0,  0,  0)  so  that,  with  probability  1,  the  target  is  initially 
located  in  cell  C Thus,  as  a triviality,  any  surveillance  operation  lasting 
zero  stages  can  always  correctly  specify  the  target  location  at  the  end  of  the 
operation.  For  the  surveillance  operations  considered  here,  however,  the 
probability  of  correctly  specifying  the  target's  location  decreases  if  the  length 
of  the  operation  is  between  1 and  10  stages.  The  reason  for  this  is  the  conflict 
between  target  motion  and  the  applied  surveillance  effort.  From  Table  II-2 
we  see  that,  in  Case  1,  target  motion  during  the  first  ten  stages  decreases 
rapidly  our  information  about  the  target's  position.  During  the  first  ten  stages 
the  available  surveillance  effort  is  insufficient  to  overcome  the  loss  of  target 
localization  caused  by  target  motion. 

If  no  surveillance  is  applied,  observe  also,  from  Table  II-2,  that  after 
about  13  stages  of  target  motion,  the  most  likely  cell  to  contain  the  target  changes 
from  cell  Cj  to  cell  C4.  Additionally,  after  15  stages  of  target  motion  the 
amount  of  target  localization  implied  by  the  motion  assumptions  above  actually 
increases.  This  is  reflected  in  Figure  II-2  by  an  increase  in  the  localization 
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achieved  by  each  of  the  surveillance  plans  after  15  stages. 


One  very  remarkable  aspect  of  Figure  11-2  is  that,  after  13  stages  of 
surveillance  effort,  the  highest  probability  cell  policy  performs  only  slightly 
better  in  localizing  the  target  than  does  no  surveillance  at  all.  The  reason 
for  this  is  that  after  13  stages,  the  target  is  most  likely  to  be  located  in  cell 
C4,  and  so  the  highest  probability  cell  policy  will  allocate  most  of  its  effort 
to  cell  C4,  But  we  have  already  indicated  that  the  sensor  response  matrix 
implies  that  investigating  cell  is  comparatively  unproductive.  Indeed  the 
effectiveness  of  such  a search  is  so  low  that  each  increment  of  effort  adds 
little  to  the  success  probability.  The  K-stage -optimal  surveillance  policy 
and  the  maximum-information-gain  policies  compensate  for  this  by  almost 
never  investigating  cell  C4.  Indeed  the  high  probabilites  of  correctly  specifying 
the  target's  location  attained  by  these  policies  are  due,  in  large  measure,  to 
exploiting  the  differences  in  response  probability  achieved  by  applying  effort 
to  the  various  cells.  These  policies  investigate  cells  Cj^,  C2.  and  C3,  and 
use  a lack  of  sensor  responses  to  localize  the  target  in  cell  C4,  The  highest 
probability  cell  policy,  on  the  other  hand,  incorporated  no  information  about 
the  sensor  response  capability  and  is  thus  independent  of  the  response  matrix  R, 

It  performs  correspondingly  poorly. 

It  is  also  interesting  to  note  in  Figures  11-2.  II-3.  II-4.  and  II-5  that 
both  the  1-stage  and  3-stage  look-ahead  maximum-information-gain  policies 
perform  much  Ijettcr  than  the  highest  probability  cell  policy,  and  nearly 
as  well  as  the  K-stagt-optimal  surveillance  plan.  These  features  are  common 
to  all  of  the  cases  studied  to  date.  A somewhat  surprising  observation 
is  that  the  3 -stage  look-ahead  maximum -information-gain  policy  does  not 
perform  significantly  better  than  does  the  1-stage  look-ahead  maximum- 
information-gain  policy.  Thus,  in  terms  of  stationary  plans,  a well-chosen 
1 -stage  look-ahead  policy  appears  to  be  almost  as  good  as  a policy  which  looks 
further  into  the  future.  We  have  yet  to  find  an  example  which  violates  this 
conjecture. 

Recall  that  the  K-stage-optimal  surveillance  plan  depends  strongly  on  the 
horizon,  and  thus  this  plan  may  be  quite  different  for  different  horizons.  The 
1-  and  3-stage  look-uhead  maximum-informution-gain  policies  are  particularly 
attractive  because  they  depend  only  on  the  present  target  location  probability 
distribution  and  not  on  the  number  of  stages  remaining  in  the  operation. 

.Additionally,  as  illustrated  by  Figures  II-2  through  11-5,  these  policies 
incorporate  sensor  response  characteristics  sufficiently  well  to  give  pro- 
babilities of  success  reasonably  close  to  the  tlieoretical  maximum  determined 
by  the  K-stage  optimal  surveillance  plan. 

One  particularly  important  feature  of  the  ma.ximum -information-gain 
policies  is  that  they  are  not  only  independent  of  the  number  of  stages  in  the 
surveillance  operation,  but  since  they  maximize  the  information  content 
of  the  posterior  distribution,  they  also  do  not  depend  on  the  measure  of  effectiveness. 


-34- 


Moreover,  they  appear  to  give  good  target  localization  for  a number  of  different 
measures  of  effectiveness. 

To  illustrate  this  point,  we  have  graphed  in  Figures  11-6,  II-7,  II-8,  and 
II-9  the  degree  target  localization  attained,  in  each  of  the  previous  four  cases, 
when  the  measure  of  effectiveness  is  the  probability  that  the  target  is  located 
in  the  tu'o  highest  probability  cells.  In  these  figures,  the  K-stage -optimal 
surveillance  plan  is  chosen  so  as  to  maximize,  at  the  end  of  K stages,  the 
expected  probability  that  the  target  is  located  in  the  two  highest  probability 
cells,  llius  the  upper  curve  in  Figures  II-6  through  II-9  indicates  a theoretical 
upper  bound  for  the  amount  of  target  localization  possible  for  this  surveillance 
objective  function.  The  1-  and  3-stage  look-ahead  maximum -information -gain 
policies  and  the  highest-probability-cell  policy  are  exactly  the  same  allocations 
of  effort  used  in  Figures  II-2  through  II -5, 

Note  that,  in  each  case  presented,  the  maximum -information-gain  policies 
yield  near  optimal  two  cell  target  localization.  Also,  after  about  13  stages, 
the  highest-probability-cell  policy  performs  only  slightly  better  than  if  no 
surveillance  effort  was  expended  at  all.  Again  the  reason  for  this  is  that  after 
about  10  stages  the  highest  probability  cell  policy  is  allocating  most  of  its 
effort  to  the  comparatively  uninformative  cell  C^. 

It  is  thus  apparent,  at  least  in  the  cases  at  hand,  that  the  maximum- 
information-gain  policies  provide  robust  estimates  for  target  location  over 
a variety  of  measures  of  effectiveness.  This  conjecture  has  been  verified  in 
each  case  studied  to  date.  This  is  an  important  result  because  such  plans 
are  computationally  easy  to  determine,  and  since  they  require  neither  a prior 
specification  of  the  number  stages  involved  in  the  surveillance  operation  nor 
knowledge  of  the  surveillance  objective  function. 

Finally,  it  is  of  interest  to  observe  the  asymptotic  behavior  of  the  various 
surveillance  plans  as  the  number  of  stages  becomes  large.  In  Figures  II-IO 
tlirou^  11-13,  we  have  graphed  the  functions 

k k 

max  A (</?,  f )(X),  and  min  A (</;,  f )PC) 

XeSo  ^ XeS„  ^ 

o 


for  k 0,  1,  2, . . . ,20,  where  fj^pC)  ^ probability  that  the  target  is  located 
in  the  highest  probability  cell,  and  tp  is  respectively  the  K-stage -optimal 
surveillance  plan,  the  1-  and  3 -stage  look-ahead  maximum -information-gain 
policies,  and  the  highest-probability-cell  policy.  By  definition,  the  range 
of  possible  payoffs  resulting  from  the  surveillance  plans  <j)  lies  between  the 
upper  and  lower  bounds  given  by  the  above  functions.  The  striking  feature  here 
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is  the  apparent  rapid  convergence  in  each  case  to  constant  values  which  depend 
on  the  surveillance  plan  but  not  on  the  prior  target  location  probability  distribution. 
This  indicates  that  precise  knowledge  of  the  initial  target  location  distribution 
is  unimportant  to  the  long  term  ability  of  a surveillance  system  to  localize 
a target.  In  view  of  this  apparent  rapid  convergence  of  the  expected  localization 
to  the  limit  values,  it  is  valuable  to  be  able  to  characterize  the  limit  values 
in  terms  of  the  surveillance  plan,  the  sensor  response  matrix,  and  the  target 
motion  matrix.  Unfortunately  we  have  not  yet  been  able  topush  matters  that 
far.  In  the  next  section,  however,  we  establish  the  existence  of  the  specified 
limit  for  the  K-stage -optimal  surveillance  plans  under  a variety  of  different 
hs^otheses. 


Asymptotic  Behavior  of  the  K-Stage  Optimal  Surveillance  Plan 

One  of  the  most  striking  features  common  to  the  examples  discussed  in 
the  previous  section  is  the  rapid  convergence,  as  K-*oo , of  the  functions 

f)(  ).  Additionally,  it  is  of  interest  to  note  that,  in  all  of  the  previous 
examples,  the  limiting  values 


lim  A^((^,f)(X)  {n-13) 

K-*ro 


depend  only  on  the  surveillance  plan  and  the  measure  of  effectiveness  f and 
not  on  the  initial  target  location  probability  distribution  X.  The  purpose  of 
this  section  is  to  discuss  various  issues  related  to  the  asymptotic  behavior 
of  the  functions  f)(-)  for  K-stage  optimal  surveillance  plans  0^. 

First  note  that  the  limit  (U-13)  need  not  always  exist  for  the  optimal 
surveillance  plan,  and  indeed  it  is  quite  easy  to  construct  counterexamples. 
To  see  this,  consider  a two-cell  surveillance  problem  involving  a completely 
uninformative  response  matrix  R;  for  some  c,  0 < c < 1, 


R 


c 


c 


Also  suppose  that  the  Markovian  target  motion  matrix  M is  given  by 


M - 


0 

1 


1 

0 
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Such  a motion  matrix  interchanges  the  position  of  the  target  between  the  two 
cells  in  a single  stage. 

Suppose  now  that,  for  X = (Xj^,  X2)'  our  objective  function  f is  given 

by 


f(X)  = axj  + bx2 


where  a and  b are  positive  real  numbers,  a;^.  Since  our  response  matrix  is 
completely  uninformative,  any  surveillance  plan  is  optimal,  and  we  will  thus 
assume  that  the  surveillance  plan  is  given  by  (p^(X,  k)  = (^,  1)  for  all 
X and  all  K - 1,  2,  3, . . , , k = 1,  2, . . . , K, 


Using  now  the  definitions  for  the  posterior  distributions  U given  in  (II-l) 
we  see,  for  l = 0,  1,  and  2,  that  ^ 


(X,  K)  = MX,  K = 1.  2,  3, . . . . 

Similarly,  if  0.  (X,  K),  I --  0,  1,  2,  are  the  event  probabilities  defined  in 
equation  (11-2),^  we  have,  for  all  X c Sj,  and  K 1,  2,  3, , . , that 


0^  (X,  K) 


1-c 

if  / = 0 

c/2 

if  Z - 1,  2 

It  follows  then  from  equation  (II-8)  that 


f)  f(M*bf). 


But  note  that  " M and  that  M““  = 1,  K - 1,  2,  3 Thus  if  X = 


+ 1 


.2K 


A^(cp’^,f)(X) 


axj^  + bx2  if  K is  even 
ax2  ^ bxj^  if  K is  odd. 


so  that  the  limit  (III-13)  will  exist  if  and  only  if  X (2,  i). 


y 
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In  spite  of  this  counterexample,  observe  that,  in  the  case  at  hand,  every 
subsequence  of  f)(*)}K“l  contains  a uniformly  convergent  subsequence. 

Similarly  the  Cesaro  limit  of  the  functions  A^(f,  (p){-)  exists  for  X e 
Indeed 


lim  ^ 2 A^(f,</)S(X)  = a + b, 
K-oo  ^ k=l 


Recall  that  the  Cesaro  limit  here  indicates  the  long-term  average  target 
localization  obtainable  using  the  objective  function  f and  the  surveillance 
parameters  R,  M,  and  q)^.  In  Theorems  II-2  and  II-3  below  we  establish 
these  two  facts  for  an  arbitrary  surveillance  objective  function. 

In  the  remainder  of  this  section  we  will  consider  a surveillance  operation 
on  N cells  Cj,  C2, . . . , Cn-  Recall  that  our  space  of  objective  functions  C 
is  the  space  of  all  convex  functions  which  are  imiformly  Lipschitz 

continuous  on^N-1-  assume  that  each  function  f c C has  been  extended 
to  a function  defined  on  the  positive  orthant  Vjj  of  by  setting 


1 


( (ip^  ) ItYeV^,  V/O 

t(Y)  ] V ' / 

0 if  Y - 0. 


Recall  that  if  f c C then  there  exists  L > 0 such  that,  for  all  Y , Y e V 

X ^ 

!f(Yj)  -ffY^)!  <L  II  - Y^llj. 

1^ 

Theorem  11-2.  Let  q?  , K 0,  1,  2,  3, ...  be  the  K-stage  optimal  surveillance 
plans  for  the  objective  function  f e C.  Then  the  family  {A^((/>^,  f)(0}K  0 
equicontinuous  family  of  functions  and  so  every  subsequence  from  this  family 
contains  a uniformly  convergent  subsequence. 

Proof.  Suppose  our  sensor  response  matrix  is  R and  that  our  target 
motion  matrix  is  M.  Choose  a constant  L > 0 such  that  if  Xj,  X2  c Vj,j  then 


f(X^)  -fpj^)!  <L 


X1-X2I 
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It  then  follows  from  Theorem  II-l  that  if  Xj,  'X2  € 


1 


I 


r 

i 


i 


[ 


I AN/,f)(Xj)  - f)(X2)(  <L  ||Xj  -xjl^ 

and  so  the  family  {A^((/)^,  f)}  is  clearly  equicontinuous. 

As  an  immediate  consequence  of  this  theorem  observe  that  every  convergent 
subsequence  of  {A^((/)^,  f)}  is  in  fact  uniformly  convergent. 

The  following  theorem  establishes  the  existence  of  the  Cesaro  limit  in 
general.  Note  that  this  theorem  is  valid  for  every  sensor  response  matrix  R 
and  every  target  motion  matrix  M.  The  theorem  states  that  there  is  a long 
term  average  expected  target  localization  for  every  prior  target  location 
distribution. 

Ir 

Theorem  II-3.  Let  </>  , k = 0,  1,  2, . . . be  the  k-stage  optimal  surveillance 
policy  for  the  objective  function  f e C.  Then  the  sequence  of  functions 

r 1 5 .k  k ,co 

^ K ’^^K=l 

is  uniformly  convergent. 

To  establish  this  result  we  need  first  the  following  lemma. 

Lemma.  Let  cp  be  the  stationary  surveillance  on  N cells  given  by 


^j(X,k)  (1/N,  1/N, 


1/N)  X € S^_^,  k - 0.  1,  2, 


If  f e C then  the  sequence  of  functions 


f)} 


CO 

K-1 


is  uniformly  convergent. 

Proof  of  Lemma.  We  first  show  that  the  family  { A^((p,  f)}  is  an  equicontinous 
family.  Choose  a constant  L such  that 


J 


We  will  show,  by  induction  on  k,  that 


I A^fXX^)  - aV,£)(X2)|  <l  ||x^ 

The  result  is  clear  for  k 0.  Using  equation  (11-8),  we  obtain 

I If  + 1 If  + 1 I 

I A*"  \<^,f)(X^)  -A  (<p,f)(X2)| 

< ^ V I a‘'(<^.  f)(MT^X^)  - a\cp,  f)(MT  Xg)  | 

+ |AN,^.£)[Ma  X T^)X^)- AV,f)[M(I-^  S 
/ ~ 1 2 ” 1 


It  follows  now  from  our  induction  hypothesis  that 


A^^\</^,f)(X^)  - A^^^<P.f)(X2)|  <L{^  S ||MT^(X^  -X^lllj 

1 


. ||M(I-i  rT,)(X,-X^)||j} 


It  is  easy  to  see  that,  for  any  vector  Y c K^,  1 1 1 1 ^ ^ 1 1 Y 1 1 whence  we  h?ve 

t If  + 1 If+l  I 

I A («7,  f)(Xj)  - A (</>,  OfX^)  I 


N 


Now  recall  that  T,  diag(ri;  , rg;  , r,,,,  ) and  let  (y,,  y^, . . . , y^)  X - X„. 
Since  0 < rj^  < 1,  it  follows  that  ' ^ ^ ^ 
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Thus  we  conclude  that 


|A‘'^\cJ,f)(X^)  - A^'VfXX^)!  < !■  ||X^  -X^ll^, 


and  so  {A  (y?,  f)}  is  an  equicontinous  family. 


Now  set 


1 K k 

G (f)(X)  - Z A (c>,f)(X). 

k=l 


The  family  of  functions  {Gj^(f)}  is  clearly  also  an  equicontinous  family,  and 
so  every  subsequence  contains  a uniformly  convergent  subsequence.  We  thus 
need  only  show  that  every  convergent  subsequence  from  the  family  {Gj,^(f)} 
converges  to  the  same  limit 


Suppose  now  that  the 
convergent.  Set 


CO 

sequences  {Gj.(^  (f)}  . _ and  {Gj-  (f)}  . 

h 32  ^2^ 


are  uniformly 


» r-/-  { g € C : { G (g)}“  and  { G (g)}  are  uniformly  convergent  sequences}, 

j,  •’l  ^2 


Clearly  f c ,,  f/.  and  non  void. 

For  each  ^ deiine  hinctionals  and  on» 


H^(g)  Um  Gj^  (g),  and  H^fg)  lim  G^^  (g), 

ii  39“*®  ^2 


For  g f t^thc  functions  Hivg)  H2te)  conve.x  uniformly  Lipschitz 
continuous  functions  on  S^:_^  and  hence  in  C.  Additionally,  for  each 
X f H2(-)(X)  are  linear  functionals  on» 


Define  now  an  operator  on  the  space  fjs/as  follows:  for  each  g c set 


1 N 1 N 

^(g)(X)  ^ ^ g(MT  X)  + g(Ma  - X T )X). 

X ^ 1 I ^ / 1 ‘ 


Note  that  <1>  is  a linear  operator  on^fl/and  that 

k k / 

A (cAg)  - 4>  (g)  ^•••^(g),  gcer/. 

We  now  claim  that  <&(g)  <" , r-y  whenever  g < ,jd . Clearly,  $(g)e  C.  and  we  need  only 

show  that  the  setiuences  {G[^.  (‘{>g)}“  , are  uniformly  convergent.  But,  for  i 1,  2, 

ii 


3 

■] 


Gr  (^g)  - Gr  (g)  ^ ^ [A  {<p,g)  - A^cNg)). 


Thus  <f>(g)  and  indeed  n|(g)  Ilj($g),  i 1,  2,  It  follows  then  that 

Gj,^(g)c,V,  fori  1,  2,  3,.,.  . 

We  also  claim  that  Hj(g)  c and  indeed  that  Hj  is  a continuous  operator 
off  into  itself.  To  see  that  H|(g)  whenever  g t tj(/  , note  that  since  Gj.;.  (g) 
converges  uniformly  to  H|(g)  it  follows  from  the  linearity  of  <!>  that  *Gj,^  (g)  ^ 
converges  uniformly  to  $H^(g),  But  ^Gk-  (g)  - Gj^;-  ($g),  and  so  Ji 

<f.Hi(g)  - Hi(<&g)  lli(g);  whence,  GK(Hi(g)y  “ Hi(g),-'^  i = 1.  2,  K - 1,  2,  3 

Thus  clearly  Hj^(g)  t 

Ixjtf^have  the  tojjology  induced  by  the  supremum  norm.  To  see  that  H. 
is  a continuous  mapping  off  .^into  itself,  let  gl,  g2  t xjd  satisfy 


!|g 


1 


sup  I g,(X)  - g (X)  I < 

Xc//' 

'^N-1 


Note  that  if  X f then  | gj(X)  - g2(X)|  < ( f /3) ! | X 1 1 so  that 


'^-fg^XX)  - 4>(g2)W!  < |, 


L 


It  follows  then  that 


3 ’ K 1,  2,  3. 

Now  choose  so  large  that  if  Kj^  > K^,  then 


Using  the  triangle  inequality  it  follows  immediately  that 


H.(gj)-H.(g2)llc<  c 


and  so  as  claimed  is  a continuous  map  oftJj/into  itself. 
Now  observe  that 


H,(f)  H,(G,.  (f))  Urn  H (G  (f))  = H (H  (f)) 

' ' No  ^2—  No 


since  H.  is  continuous  on  eft.  Similarly 
i 


Hi(f)  = (H^(f))  = H^fH^lf)). 


Interchanging  the  role  of  and  II2  above  we  obtain 

H^(f)  - H^Hgff)  - H2Hj(f)  - H2(f) 


and  so  every  convergent  subsequence  from  the  family  has  the 
limit,  as  claimed.  This  completes  the  proof  of  the  Lemma. 


1 

.4 


) 


same 
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Proof  of  Theorem  11-3.  For  k 1,  2,  3, , . , let  0 be  the  k-stage  optimal 
surveillance  plan  for  the  measure  of  offr>ctiveness  function  f.  Recall  that 
each  is  determined  entirely  by  the  number  of  stages  remaining  in  the 
surveillance  operation,  so  that,  for  all  X e 

k^  - Z)  = ^2  - / ) Z = 0,  1,  2, . . . min  {k^.k^}. 

For  each  j ^ 1,  2,  3, . . . let  be  the  k-stage  surveillance  plan  defined  by 


!*' jk(X,Z) 


(X,  Z)  if  k < j 

/L  1 Iv 

N’  N’‘*'’N  ^ 

(/)j(X,Z-k+j),  Z k-jfl k ifk>i. 


Thus  if  k > j,  allocates,  during  the  first  k~j  stages  of  the  surveillance 
plan,  the  same  amount  of  effort  to  each  cell.  During  the  remaining  j stages 
of  the  plan,  ;/  allocates  effort  according  to  the  optimal  surveillance  plan  (/)j. 


Now  define  an  operator  $ : C C by 


❖ (f)(X)  - ^ V f(MT  X)  + f(M(I  - ^ X 
^1=1  I ^11 


1 N , 

N 2:  T^(X)). 


It  is  easy  to  see  that 


A ((  ,,.0 


aN/.O, 


k=  1,  2,.. 


(A^((/?^f)).  k j+1,  j^2. 


Whence  if 


1 .k 


^K.  K ^ k 1,  2,  3 
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It  follows  that  the  sequence  uniformly  to  a function  H-e  C, 

j 1,  2,  3 ^ ~ 

Suppose  now  that  < j2-  follows  then  that 

j .f)  - 
•’^2 


■’2-2 


In  particular,  for  < k,  we  have,  by  induction  on  k,  that 


A (!/'.  . ,f)  < A (rf  : ^.f) 

X M 


whence  Hjj^  < Uj^.  Since  H.  < (IfHoo  I I it  follows  that  the  sequence 

H j converges  uniformly  to  a%unction  He  C.  N-1 

Consider  now  the  diagonal  sequence  of  functions 


1 .k. 


K 


.k.  k 


^KK  K ^ ^‘''K,k’^  K A ((P  ,f). 


It  is  easy  to  see  nf>w  that  this  sequence  converges  uniformly  to  H and  we 
have  thus  obtained  the  desired  result. 

The  following  two  theorems  establish  limit  results  for  the  sequence  of 
functions  f)}j?  under  various  hypotheses  concerning  the  target  motion 

matrix.  Ibc  first  theorem  is  most  useful  in  the  case  of  a stationary  target, 
for  then  the  target  motion  transition  matrix  is  the  identity  matrix.  The  second 
theorem  is  applieatde  to  many  moving  target  situations,  in  particular  to 
those  presented  in  the  previous  section. 

'Fheorem  11-4.  Let  , K = 0.  1,  2. . . . be  the  K-stage-optimal  surveillance 
plan  for  an  objective  function  f e C.  Let  M be  the  target  motion  transition 
matrix  and  su^jpose  that  I’  {X  t MX  X}.  Then  the  sequence 

J converges  uniformly  on  U. 


Proof.  Let  X e U^;  then 


\f)(X)  > A^/.f)(MX) 


>AV,f)(X). 


K K 

and  so  the  sequence  { A ((;^)  , increasing  sequence  and  so  uniformly 

convergent  on  U. 

Theorem  11-5.  Let  (p  , K = 0,  1,  2, ...  be  the  K-stage  optimal  surveillance 

plan  for  the  objective  function  f e C.  If  the  target  motion  matrix  M has  a row 

with  no  zero  entries  and  if  every  entry  of  the  sensor  response  matrix  is  less 

than  1,  then  the  sequence  {A^(<^^,  f)}®  - converges  uniformly  to  a constant. 

K-  X 

Proof.  Let  M [d^.]  and  suppose  that  dj^j  >0  >0,  j = 1,  2, . , , , N. 

If  X = (xi, . , . , xn)  and  Y = (yi, . . . , y^^)  t consider 


||mX-My!|  = 2|s  d (X  -y  ) 
i i j 1]  1 3 


Observe  that 


Id.  .(x.-y.)  - L (d.  .-aXx.-y.) 
3 o ■'  3 o^ 


< 2 (d.  .-a)  x.-y. 

~ j V ■’  ^ 


< 0 2 d.  . x.-y. 

■ J V J J 


for  some  0,  0 < 0 < I,  whence 


mx-my|L  <2(i-d.  .+  ed.  .)  I x.-y.  I 

1 - j 1 j 1 j ' j j' 

< (l-(l-0)a)||X-Y||^. 


But  1-(1-  0)a  < 1 so  that  M is  a contraction  mapping  of  llnis  there  is 

a unique  such  that 


lim  X , allX 

o N-1 

K—  00 


Now  observe  that  MX  X , whence  from  equation  (11-12)  we  obtain 


A^^\/^\f)(X^)  > AV,f)(MX^)  = A^</.^0(X^). 


K K 

Thus  the  sequence  of  numbers  [ A (</>  , 0(X^)}  converges. 
We  will  now  show  that 


lim  sup{  I A^(<^^,f)(X)  - AV,f)(Y)  : X,  Y 0. 

K-*oo 


To  see  this,  define  for  K ^ 1,  2,  3,. . . numbers  Lj^  such  that 


A^</p^f)(X)  -AV.f)(Y)  ! <Lj^  1|X-y||^,  X,  Y e V^. 


The  existence  of  the  numbers  was  established  in  Theorem  ll-l.  Suppose 
now  that 


A'^^\<p'^'^\f)(X)  = A^(/f)(MT^X)  + AV^f)(M(l-T^)X) 


> A^'^(^‘^'■^f)(Y). 


We  then  have 


< I A^/.f)(MT^X)  - AV.f)(M(T  Y)| 


+ I A^(/.f)(M(I-TpX)  - A^<^^f)(M(I-T^  )Y) 


< Lj^  { 1 1 MT  (X-Y)  1 1 ^ + 1 1 M(I-Tp{X-Y)  1 1 ^} . 


But  if  R = [Tjj],  then  by  our  assumption  on  R,  there  is  a /5  > 0 such  that 


d (l-r,)>;3>0,  ,=  1.2.3 N. 

O J 

We  then  obtain,  as  above,  for  some  appropriately  chosen  p,  0 < p < 1, 


||m(I-T^)(X-Y)  11^  2|zd..(l-r^  )(x,-y.)| 


< fiZ  (1-r  )(x  -y  ). 
j j ^ ^ 


Again  using  our  assumption  that  r^  < 1,  we  have 

,i 


|a  - A*^  \f)(Y)|=  V l/x+r  (l-/u)J  ( x.-y.  I 

i ^ i J J 


, k+1,  k+1 


i 


< l|X-Y||, 


for  some  appropriately  chosen  constant  e , 0 < e <1.  Observe  that  o is 
independent  of  K,  whence,  by  induction  o o o 


Lk<V  (K-®). 


4. 


Now  since  the  sequence  of  numbers  {A  f)(X^}“  is  convergent  and 
e Li,.-*0,  (K— °o),  it  follows  as  claimed  that  is  uniformly 


since  L, 


convergent  to  a constant. 


An  Important  Special  Case:  The  Homogeneous  Sensor 


In  this  section  we  describe  an  important  special  surveillance  sensor 
whicn  is  similar  to  that  encountered  in  a number  of  operational  situations. 

Consider  a sensor  which  is  only  able  to  detect  a target  if  it  investigates 
the  cell  which  contains  it.  Occasionally,  however,  the  sensor  will  produce  a 
non-target-generated  contact.  These  contacts  may  occur,  for  example,  because 
of  random  acoustic  fluctuations,  electronic  instabilities,  or  sensor  operator 
error. 

Suppose  that,  if  the  sensor  investigates  the  cell  which  contains  the  target, 
the  probability  that  the  sensor  will  obtain  a target-generated  contact  is  fi . 
Additionally,  suppose  that  the  probability  of  obtaining  a non -tar  get -generated 
contact,  p . is  independent  of  the  cell  investigated. 

We  seek  the  sensor  response  matrix  for  the  case  at  hand.  Since,  if  we 
search  in  a cell  which  does  not  contain  the  target,  the  only  possible  sensor 
response  must  be  non-target  generated,  it  follows  from  the  definition  of 
r^j  that  rij  - p when  i/].  On  tlie  other  hand,  if  we  investigate  the  cell  which 
does  contain  the  target.tlien  we  assume  a sensor  response  may  te  obtained 
in  two  mutually  exclusive  ways;  either  a sensor  response  may  be  target 
generated  or  it  may  be  non-target  generated.  Setting  \ /i  * p.  it  follows 
that  rii  A , i 1,  2. . , . , N.  The  sensor  response  matrix  is  thus  the  N x N 
matrix 


A p p ...  p’ 


p A p ...  p 


H p p A 


P P 


Such  a sensor  is  called  a homogeneous  surveillance  sensor  with  parameters 
(A,p).  \ > p.  The  quantity  \ -p  is  the  target-generated  contact  rate  and  p 
is  the  non-target-generated  contact  rate.  In  terms  of  valid  contacts  (i.  e. , 
contacts  coming  from  the  cell  containing  the  target  I’egardless  of  whether 


they  are  target  generated),  X is  the  valid  contact  rate  and  n is  the  false  contact 
rate. 


For  I 1,  2, . . . , N we  now  compute  U (X,  k),  the  posterior  target  location 
probability  distribution  at  the  end  of  stage  k given  that  the  location  distribution 
at  the  beginning  of  stage  k is  X = (Xj, . . . ,Xj^)  and  that  there  was  a sensor 

resDonse  from  cell  . Let  e^  = (0,  0, . . . , 0,  1,  0 0),  where  the  1 occurs 

in  the  coordinate.  It  follows  then  from  equation  (11-1)  that 


(X-p)x^ 

e + X . 

(X-p)x^  + p I (X-p)x^  + p 

and  it  is  instructive  to  interpret  the  terms  on  the  right-hand  side  of  this  identity. 
Observe  that 


(A-m) 

(X-/i)x^  + M 

is  the  conditional  probability,  given  that  a contact  occurred  from  investigating 
cell  , that  the  contact  was  target  generated;  also  e/  is  the  posterior  target 
location  probability  distribution  given  that  the  contact  was  target  generated 
(for  then  the  target  must  be  in  cell  ).  Also, 


(X-H)x^  + H 

is  the  conditional  probability,  given  that  a contact  occurred  from  investigating 
cell  Cl , that  the  contact  was  non-target  generated.  In  this  case,  the  posterior 
target  location  probability  distribution  is  X,  the  same  as  the  prior,  since  no  new 
information  has  been  gained. 

Similarly,  we  can  compute  UgfX,  k),  the  posterior  target  location  at  the 
end  of  stage  k given  that  the  prior  distribution  at  the  beginning  of  stage  k is  X 

and  that  there  were  no  sensor  responses  during  stage  k.  For  j 1,  2 N, 

let  c>j(X.  k)A  be  the  amount  of  surveillance  effort  allocated  in  cell  C during 
stage  k,  and  set  k ^ (X-  /u)  (1  -//).  We  then  obtain,  from  equation  (II-l),  that 


(X, k)  = M 
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4 


Ujj(X,k)  = M 


N 

j=l  <;>j(X.k)[X-KXjej] 

N 

.2  ^ p{.k)[l->cx  ] 
J-1  J i 


and  the  probability  of  obtaining  no  sensor  response  is 


(1-h) 


N 

2 (X,k)[l-KX.] 

j 1 J J 


Consider  now  a surveillance  sensor  with  the  response  matrix 

"c  0 0 ...  0~ 

0 £ 0 ...  0 

0 0 e ...  0 . 


i_0  0 0 ...  ej 

Such  a sensor  is  a homogeneous  surveillance  sensor  with  parameters  (e  , 0).  Every 
conbict  for  such  a sensor  is  target  generated,  and  the  occurrence  of  such  a contact 
completely  localizes  the  target  to  a single  cell.  If  for  j 1,  2,  . . . , N,  k)A 

is  the  amount  of  surveillance  effort  allocated  to  cell  Cj  during  stage  k by  such 
a sensor,  the  posterior  distribution  at  the  end  of  stage  k given  no  contact  is 


N 

2 (X,k)  pC-<  X e 1 

j 1 J J J 


N 


i: 

It  follows  now  that  a homogeneous  surveillance  sensor  with  parameters 
I (A . h)  achieves  the  same  posterior  distribution  given  no  contact  as  does  a 

[ homogeneous  surveillance  sensor  with  parameters  {(X  - p)/(l  -p),  0)  when 

f both  are  used  in  the  same  surveillance  plan.  Moreover  the  ratio  of  their 

(probabilities  of  obtaining  no  contact  is  1-p. 

The  K-Stage  Optimal  Surveillance  Plan  for  a Homogeneous  Sensor 

Consider  a homogeneous  surveillance  sensor  with  parameters  (X,  p),  X> 
which  is  attempting  to  localize  a target  located  in  one  of  N cells  C2, .... 
Suppose  also  that  the  target  motion  matrix  M is  given  by  the  circulant  matrix 
of  the  form,  with  0 < 6 <1, 


i 


i 


i 


M 


N 


6, 

N 


N 


N 


j6 

N 


_5 

N 

N 

N 


1 A 1 

N N N ~ N 


Such  matrices  were  originally  considered  as  target  transition  matrices  for 
surveillance  problems  by  Richardson,  reference  [a]. 


The  parameter  5 is  called  the  dispersion  constant  of  the  target  motion  \ 

matrix.  Observe  that  if  6 -=  0,  then  the  target  is  stationary,  whereas  if 
6 1 the  target  distribution  disperses  to  the  uniform  distribution  in  one 

step.  Additionally  if  X = (xj, , . . , Xj^)'  is  the  initial  target  location  pro- 
bability distribution,  assuming  that  no  surveillance  effort  is  expended, 
the  posterior  target  location  distribution  after  k stages  is 


N ’ 


In  particular,  if  6 >0,  the  target  location  probability  distribution  converges 
uniformly  to  the  uniform  distribution. 
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a.  o 


1 


The  optimal  whereabouts  search  introduced  by  Kadane  in  reference  [b  ] 
is  the  special  case  where  X > 0,  p - 0,  ^ 0,  and  the  surveillance  objective 

luiiction  is  the  probability  that  the  target  is  located  in  the  highest  probability 
cell.  ITius,  in  the  situation  considered  by  Kadane,  the  target  is  stationary 

and  a sensor  response  can  occur  only  if  the  target  is  located  in  the  cell  being  < 

investigated  (i.e. , there  are  no  false  contacts).  Kadane  was  able  to  show' 
that  the  optimal  allocation  of  surveillance  effort  in  an  optimal  whereabouts 

search  is  always  to  investigate  the  second  highest  probability  cell.  See,  ; 

for  example,  section  4. 4 of  reference  [c  ].  Note  in  particular  that  this 

optimal  allocation  depends  only  on  the  current  target  location  probability 

distribution  and  not  on  the  horizon,  and  is  thus  a stationary  surveillance 

plan.  Because  of  this,  the  resulting  surveillance  plan  yields  uniformly 

optimal  probabilities  of  localizing  the  target  for  any  possible  horizon. 

We  now  view  the  K-stage -optimal  surveillance  plan  for  the  homogeneous 
sensor  and  circulant  target  /notion  matrix  as  a generalization  of  the  optimal  I 

whereabouts  search.  Remarkably  enough,  our  numerical  results  appear  to  I 

indicate  that  the  optimal  allocation  of  surveillance  effort  when  g > 0,  6 > 0,  | 

and  the  objective  function  is  the  probability  that  the  target  is  located  in  the  j 

Highest  probability  cell  is  the  same  as  for  Kadane' s optimal  whereabouts 

search.  Thus  for  any  homogeneous  surveillance  sensor,  when  used  against  I 

a target  whose  motion  matrix  is  circulant.  we  conjecture  that  the  K-stage-  | 

optimal  surveillance  plan  is  the  stationary  plan  which  allocates  the  available  | 

effort  at  each  stage  to  the  second  highest  probability  cell.  | 

Another  possible  generalization  of  Kadane's  optimal  whereabouts  search  I 

is  to  consider  a different  surveillance  objective  function.  Suppose  for  example 

that  we  have  a homogeneous  surveillance  sensor  with  parameters  (X,g).  i 

A > p > 0,  which  is  attempting  to  localize  a target  whose  stochastic  motion  i 

matrix  is  circulant  with  dispersion  constant  6 > 0.  If  the  surveillance  problem 

involves  N cells,  C2, . . . we  can  consider,for  1 < n < N-l,the  surveillance 

obiective  function  given  by  the  probability  that  the  target  is  located  in  the  n 

highest  probability  cells.  In  this  case,  our  numerical  results  indicate  that 

the  K-stage -optimal  surveillance  plan  is  the  stationary  plan  which  allocates 

at  each  stage  the  entire  effort  to  the  (n+l)th  highest  probability  cell. 

On  the  basis  of  these  numerical  results,  we  conjecture  that  in  the  general 
case  of  a homogeneous  surveillance  sensor  and  a target  whose  stochastic  • 

motion  matrix  is  circulant,  the  K-stage-optimal  surveillance  plan,  relative 
to  the  objective  function  which  gives  the  probability  that  the  target  is  located 
in  the  n highest  probability  cells,  is  the  stationary  plan  w'hich  allocates  at 
each  stage  the  available  effort  to  the  (nel)th  highest  probability  cell.^  If  tliis 
conjecture  is  established  theoretically,  it  will  have  a number  of  important 
consequences.  First  note  that  such  a surveillance  plan  depends  only  on  the 
current  target  location  probability  distribution  and  not  on  the  number  of  stages 


Sincf'  thi.s  ha.s  been  written, 
foi  n 2 by  J.  It.  Weisinger. 


a counterexample  to  this  conjecture  has  been  found 
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in  the  operation.  Tliis  plan  would  then  result  in  uniformly  optimal  probabilities 
of  localizing  the  target  for  each  possible  choice  of  horizon.  Moreover,  we  feel 
that  it  is  reasonable  to  model  many  operational  situations  with  a homogeneous 
sensor.  Since  in  this  case  the  K-stage-optimal  surveillance  plan  would  give 
optimal  results  for  every  horizon,  it  has  potential  for  widespread  applications. 

In  order  to  understand  this  conjecture,  recall  that  the  goal  of  surveillance 
is  target  localization  rather  than  target  detection.  The  surveillance  objective 
function  defines  what  is  meant  by  target  localization.  Thus  for  example  if 
we  need  only  localize  the  target  to  n out  of  N cells,  then  the  appropriate  objective 
function  is  the  probability  that  the  target  is  located  in  the  n highest  probability 
cells,  for  if  we  use  our  surveillance  sensor  so  as  to  maximize  this  quantity, 
we  will  maximize  the  chance  of  localizing  the  target  to  the  specified  number 
of  cells.  If  we  now  allocate  our  surveillance  effort  at  each  stage  to  the  (n+l)^^ 
highest  probability  cell,  we  are  effectively  using  our  effort  to  minimize  the 
chance  that  our  best  estimate  of  n cells  which  contain  the  target  is  wrong. 


Examples  Involving  a Homogeneous  Surveillance  Sensor 

Suppose  that  a homogeneous  sensor  with  parameters  (X,  p)  is  used  against 
a stochastically  moving  target  located  in  one  of  four  cells  Ci.  C2,  C3,  C4. 

We  assume  that  the  target  motion  matrix  is  a circulant  matrix  with  dispersion 
constant  6 . 

In  Figures  11-14  and  11-15  we  compare  the  effect  of  the  dispersion  constant  5 
on  the  K-stage-optimal  surveillance  plan  using  a homogeneous  sensor  with 
parameters  (.  1,  . 01),  In  Figure  11-14  the  initial  target  location  probability 
distribution  is  (.  7,  .1,  .1,  ,1).  Observe  that,  in  the  case  of  a stationary  target 
(6  =0),  the  probability  of  specifying  the  target's  location  to  a single  cell  increases, 
as  surveillance  effort  is  expended, monotonically  to  1.0.  On  the  other  hand, 
if  6 > 0 then 

Urn  (.25,  .25,  .25,  .25). 

k-*<D 


so  that  if  no  surveillance  is  performed  the  target  distribution  aymptotically 
becomes  uniform.  Again,  the  parameter  6 determines  how  rapidly  this 
convergence  occurs  and  if  5 - 1,  then  the  target  location  distribution  disperses 
to  a uniform  distribution  after  a single  stage.  This  is  illustrated  in  Figure  11-14 
by  the  fact  that,  if  6 =1,  the  probability  of  correctly  specifying  the  target's 
location  after  one  or  more  stages  of  surveillance  is  .25.  Tlie  other  two  cases 
considered  correspond  to  dispersion  constants  of  .2  and  . 4. 
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2)  C'ii'cuUnil  target  m >ti '.n  matrix  with  dispernion  eanslant  6 


All t)il)(‘t'  , if  stage!'  in  surveillance  o|h  rati  ni 


I 

6 

r 
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In  Figure  11-15  we  have  graphed  the  influence  of  target  motion  on  the 
K -stage -optimal  surveillance  plan  usLif,  thu  same  sensor  parameters  as  in 
Figure  11-14  but  assuming  that  the  prior  probability  distribution  is  (.  3,  . 3,  2,  . 2). 
The  cases  considered  correspond  to  dispersion  constants  of  6 = 0 (stationary 
target),  6 2,  6 .4,  and  6 1 (complete  dispersion  in  a single  stage). 

FinaLy,  in  Figures  11-16  and  11-17,  we  indicate  the  influence  of  the  false 
contact  rate  on  the  K-stage -optimal  surveillance  plan.  We  consider  three 
cases  involving  homogeneous  surveillance  sensors  with  parameters  (X,p) 
where  X . 1 and  p takes  the  values  ,001,  ,01,  and  . 05,  respectively.  We 
also  suppose  that  the  target  motion  matrix  is  a circulant  matrix  with  dispersion 
constant  6 = .2.  In  Figure  11-16  we  assume  that  the  prior  target  location 
probability  distribution  is  (.  7,  .1,  .1,  . 1)  whereas  in  Figure  11-17  it  is 
(.3,  .3,  .2,  .2). 


I Conclusion 

IOur  investigations  have  established  the  K-stage -optimal  surveillance  plan 
as  the  solution  of  a particular  dynamic  programming  problem.  Unfortunately 
r solving  this  dynamic  programming  problem  generally  requires  a tremendous 

[ amount  of  computational  effort.  Moi'eover.  except  perhaps  in  the  case  of  a 

I homogeneous  surveillance  sensor,  the  K-stage -optimal  surveillance  plan 

i depends  strongly  on  the  time  horizon.  Thus  in  general  there  exists  no  uniformly 

^ optimal  surveillance  plan. 

Our  numerical  examples  have  shown  that  the  1-stage  look-ahead  maximum- 
i information-gain  plan,  in  addition  to  being  easily  determined,  is  a good  suboptimal 

surveillance  plan  over  a wide  variety  of  surveillance  objective  functions.  Moreover, 
since  this  plan  is  a stationary  plan,  it  can  be  trivially  extended  to  a surveillance 
plan  of  arbitrary  length. 

Other  surveillance  policies,  such  as  the  highest-probability-cell  policy, 
can  provide  substantially  less  target  localization  thaji  either  the  K-stage -optimal 
surveillance  policy  or  the  1-stage  look-ahead  maximum-information-gain  policy. 

The  reason  for  this  is  that  the  highest  probability  cell  policy  ignores  all 
knowledge  concerning  the  response  characteristics  of  the  surveillance  sensor 
as  well  as  knowledge  concerning  the  target's  motion.  Thus  the  highest  probability 
cell  policy  may  allocate  its  effort  to  an  unproductive  cell,  with  a corresponding 
penalty  in  target  localization. 

Our  a.symptotic  results  have  established  the  existence  of  the  limiting  average 
[layoff  for  the  K-stage -optimal  surveillance  plan.  Additionally,  under  certain 
assumptions  concerning  the  target  motion  matrix,  the  expected  target  localization 
achieved  tiy  the  K-stage -optimal  surveillance  plarr  converges  to  a limit  which 
is  indeiiendent  of  the  initial  target  location  distribution.  Thus,  in  tliis  case, 
the  K -stage -optimal  surveillance  plan  asymptotically  produces  estimates 
for  the  target's  location  wliich  are  robust  against  errors  in  the  prior  distribution. 
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Number  of  stiif^cs  in  siirvei  1 lance  ofx'iatijn 


= number  of  stages  in  surveillance  operation 


Similar  results  are  apparently  valid,  from  our  numerical  examples,  for  stationary 
surveillance  plans. 

Another  consequence  of  our  numerical  examples  has  been  to  show  that  if 
a homogeneous  surveillance  sensor  is  deployed  against  a target  whose  motion 
matrix  is  of  a special  type,  then  the  resulting  K-stage -optimal  surveillance 
plan  is  stationary  and  of  a particularly  simple  type.  For  example,  if  the 
suiweillance  objective  is  to  localize  the  target  to  k cells,  the  optimal  plan 
appears  to  be  to  investigate  that  (k+1)*'^  highest  probability  cell. 


-(i8- 


A STATISTICAL  MODEL  FOR  PROCESSING  ASW  CONTACT  INFORMATION 
TO  ESTIMATE  TARGET  PATTERNS  OF  OPERATION 


In  this  chapter  we  consider  two  major  problems  which  an  ASW  planner  must 
frequently  face  in  the  presence  of  sparse  contacts  of  various  types  and  quality. 

The  first  is  to  obtain  an  estimate  for  the  track  of  a specified  target,  and  the 
second  is  to  make  inferences  about  overall  target  behavior  patterns  on  the  basis 
of  contact  data  from  a number  of  different  targets.  The  purpose  here  is  to 
describe  a Bayesian  method  for  obtaining  both  of  these  types  of  estimates. 

Unfortunatcl}-,  the  methods  in  this  chapter  have  not  yet  been  developed  to  the 
point  where  numerical  examples  have  been  computed,  and  so  none  are  included. 

Our  approach  is  based  on  a parametric  model  for  target  motion.  The  object 
is  to  use  the  available  contact  data  to  obtain  posterior  estimates  for  the  target's 
track  as  well  as  the  parameters  which  describe  target  motion.  A major  consideration 
in  what  follows  is  the  development  of  a parametric  model  for  target  motion  which 
is  rich  enough  to  model  real  world  situations  but  which  is  also  computationally 
tractable. 

The  approach  outlined  below  is  most  applicable  to  the  case  of  a target  in 
transit.  Additionally,  since  we  are  obtaining  Bayesian  estimates  we  are 
required  to  assume  the  existence  of  patterns  of  motion  for  which  we  have  a 
reasonable  prior  knowledge.  This  prior  knowledge  may  take  the  form  of  past 
operational  experience,  or  may  be  a consequence  of  certain  operational  or 
geographical  constraints. 

In  the  first  section  of  this  chapter,  we  describe  our  parametric  model  target 
motion.  The  model  is  based  on  the  notion  of  a number  of  different  target  scenarios 
or  basic  target  tracks.  It  is  assumed  in  this  section  that  the  parameters  for 
the  motion  model  are  completel ' known. 

In  the  second  section  we  suppose  that  the  parameters  for  the  target  motion 
model  are  unknown  but  that  prior  information  about  their  possible  values  has  been 
quantified  in  the  form  of  a probability  distribution.  This  probability  distribution 
is  the  key  item  required  to  perform  the  Bayesian  updating  for  the  parameters  of 
the  target  motion  model. 
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Wo  describe  in  the  third  and  fourth  sections  the  calculations  necessary  to 
perform  the  Bayesian  updating  on  the  pi\^bability  distributions  for  the  target 
motion  parameters  so  as  to  account  for  the  information  provided  by  a completely 
known  target  track.  Although  it  is  unlikely  that  such  high  quality  information 
would  ever  be  available,  this  is  the  simplest  case  to  consider,  and  the  calculations 
are  suggestive  of  the  direction  taken  in  subsequent  sections. 

The  fifth  section  describes  our  model  for  target  contacts.  We  assume  that 
each  contact  provides  us  with  an  estimate  for  the  target's  actual  position  together 
with  a covariance  matrix  which  represents  the  uncertainty  in  the  contact  data.  The 
next  section  describes  the  calculations  necessary  to  marry  the  parametric  target 
motion  model  w'ith  contact  data  so  as  to  obtain  a Bayesian  estimate  for  the  track 
of  the  specified  target.  This  estimate  consists  of  posterior  scenario  weights,  a 
mean  target  track  for  each  scenario,  and  associated  covariance  matrices. 

The  calculations  necessary  to  perfoim  the  Bayesian  updating  for  the  distribution 
^ on  parameters  of  the  target  motion  model  so  as  to  account  for  contact  information 

^ on  a given  target  are  described  in  the  last  two  sections  of  this  chapter.  The 

^ calculations  here  are  strongly  motivated  by  those  given  in  the  third  and  fourth 

! sections. 


A Parametric  Model  for  Target  Motion 

In  this  section  we  describe  a parametric  model  for  target  motion.  This 
model  is  based  on  the  notion  of  target  scenarios.  We  postulate  the  existence  of 
a finite  number  of  such  scenarios,  or  operating  plans,  which  a target  might  follow 
during  a specified  phase  of  its  mission.  Each  scenario  may  be  thought  of  as 
corresponding  to  a basic  geometric  pattern  of  target  motion.  We  assume  that 
each  target  chooses  one  of  these  basic  patterns  and  follows  it  throughout  the 
specified  mission  phase. 

Each  scenario  is  determined  by  a mean  target  track  and  corresponding 
covariance  matrix.  Once  a scenario  has  been  chosen  for  a given  target,  the 
target  must  move  roughly  according  to  the  mean  track  of  the  scenario.  The 
target,  however,  is  permitted  to  operate  with  some  deviation  from  the  mean  track. 
For  example,  the  target  may  move  faster  or  more  slowly  than  the  specified  mean 
track,  or  it  may  vary  its  course  along  the  mean  track.  The  extent  of  these 
perturbations  in  target  motion  is  determined  by  the  covariance  matrix  associated 
with  the  scenario. 

An  example  cf  some  basic  tracks  which  might  be  used  to  define  a scenario 
is  given  in  Figure  III-l.  Additionally,  a number  of  target  tracks  drawn  from  the 
target  scenarios  are  indicated  in  Figure  III-l. 


EXAMPLE  OF  TARGET  SCENARIOS  AND  SAMPLE  TARGET  TRACKS 


Target  Scenario  3 


In  order  to  formulate  precisely  our  parametric  model  for  target  motion, 
suppose  that  there  are  J possible  target  scenarios:  S2,...,Sj.  Let^  be 

the  random  variable  with  range  in  the  set  { 1,  2, . . . , j}  which  specifies  the 
scenario  a target  will  follow.  Thus  the  event  ( ^ ^ j}  represents  all  target 
tracks  which  follow  scenario  Sj.  We  let  pj  be  the  probability  that  a target  will 


follow  scenario  S; 


Pr  j)  = Pj" 


We  now  suppose  that  a target  track  is  specified  by  the  position  of  the  target 
at  times  tj  < t2  < . . . < t,-.  Thus  each  target  track  is  a sequence  Z = (Zj^, . . . ,Zj) 
of  target  positions  z c IR^.  Let  Z be  the  2r  dimensional  random  vector  which 
specifies  a target's  track.  The  conditional  distribution  function  of  Z,  given  that 
it  represents  the  track  of  a target  operating  according  to  scenario  Sj,  is  assumed 
to  be  a multivariate  normal  distribution  with  mean  pj  and  covariance  matrix  Sj”^, 

In  order  to  simplify  the  following  discussion,  we  will  use  the  notation 
f(*  \ij  , 2)  to  indicate  a normal  density  function  with  mean  ju  and  covariance 
matrix  2“^,  The  matrix  2 is  called  the  precision  matrix  of  the  distribution. 

In  this  notation  we  have,  for  a measurable  subset  U of  IR^''", 


Pr{Zeul^=  j}  =/f(z|^.,  2.)dz. 


The  Parametric  Target  Motion  Model  in  the  Face  of  Uncertaint 


We  now  assume  that  the  parameters  in  the  target  motion  model  described 
above  are  uncertain,  but  that  we  have  prior  knowledge  about  their  possible  values 
which  we  will  quantify  as  probability  distributions.  In  particular,  we  assume 
imprecise  knowledge  about; 

i)  the  vector  of  probabilities  p = (p^,  P2 Pj)  which  describes 

the  distribution  of  the  random  variable 

ii)  the  mean  target  paths  fi  , fji  , ,fx  , and 

1 ^ J 

iii)  the  covariance  matrices  2 2 ^, . . . , 2 

1 ^ tj 

\\  '■  do  assume,  however,  that  the  number  of  possible  target  scenarios,  J,  has 
ah  jady  been  established.  Our  goal  now  is  to  use  contact  data  on  a number  of 
different  targets  to  Improve  our  knowledge  of  these  parameters. 
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We  now  replace  each  of  the  above  parameters  with  a random  variable.  Let 
Q be  a random  vector  with  values  in  tl.o  7-fold  Cartesian  product  [0,  l]x-  • -xfO,  1], 
such  that  the  coordinates  of  Q sum  to  1.  Let  , Lj  be  random  vectors  with 

values  in  and  let  , Aj  be  random  matrices  with  values  in  the  cone  of 

positive  definite,  symmetric  2 t x 2 t dimensional  real-valued  matrices.  We  will 
use  Q to  represent  our  state  of  Imowledge  concerning  the  vector  of  probabilities 
p;  r^, ....  Tj  to  represent  our  state  of  knowledge  concerning  the  mean  target 
tracks  . . . , /ij;  and  Aj,  . . . , Aj  to  represent  our  state  of  knowledge  concerning 
the  precision  matrices  Si,  . . . , 2j.  Note  that,  although  our  primary  interest  is 
in  obtaining  estimates  for  the  covariance  matrices  SJ^, . . . , Sj^,  for  technical 
reasons  we  use  the  random  variables  Ai,  . . . , Aj  to  represent  our  state  of 
knowledge  about  the  precision  matrices  Si, . . . Sj. 

Now  let  pr  be  the  prior  joint  probability  density  function  for  the  random 
quantities  Q,  . . . , Fj.and  A^^,  . . . , Aj.  The  prior  density  function  pr  must  be 
chosen  with  considerable  care.  Indeed  the  ease  with  which  we  will  ultimately 
obtain  estimates  for  p,  . . . , ^j,and  Si, . . , , S j depends  strongly  on  the  form 
of  the  density  function  pr.  In  particular,  it  is  important  that  the  posterior  distribution 
for  Q,  , Fj,  and  Ai, . . . , A j,  given  a certain  collection  of  contact  data, 

be  of  the  same  form  as  the  prior  density  pr.  Finally,  to  facilitate  computations, 
it  is  necessary  that  the  density  function  pr  have  a simple  form  which  can  be 
completely  described  with  as  few  parameters  as  possible. 

We  have  yet  to  find  a density  function  pr  which  completely  satisfies  all  of 
the  above  described  constraints.  It  is  possible,  however,  to  formulate  a prior 
density  function  pr  which  fulfills  our  demands  reasonably  well. 

We  will  assume  that  the  parameters  which  describe  a specified  scenario 
are  independent  from  one  another  as  well  as  independent  from  the  scenario 
weights.  If  vve  then  let  prQ  be  the  marginal  density  function  for  Q,  and  prj, 
j 1,  2,  . , . , J,  be  the  joint  marginal  density  function  for  and  Aj, 
by  our  independence  assumption  we  can  write  pr  as  the  product 


pr(q,  , Tj, 


J 

.aj)-pr^j(q)  ^n^pr.  (y.. 


a.). 


(III-l) 


We  now  assume  that  prQ,  the  density  function  for  Q,  is  a Dirchlet  distribution 
with  parametric  vector  a (aj, , . . , aj),  aj  > 0,  j --  1,  2,  . . . , J.  Using  the 
notation  g(.  [ a)  to  indicate  such  a aensity  function.  g(q|  a)  then  has  the  form 


F(a^  . . fttj) 

F(a^)  ...  r(aj) 


a -1  a_-l 

q 1 q,  2 . 


a -1 

qjJ  . 


with  0 < q.  < 1, 


1 < i < J. 


We  now  describe  the  density  functions  prj,  j 1,2,  ... , J.  We  assume  that 
the  conditional  distribution  of  Ij,  given  that  Aj  - a,  is  a multivariate  normal 


i 

i 

■.j 

i 
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distribution  with  mean  vector  y . t and  covariance  matrix  (v^  a)  ^ where  v-  > 0. 
Additionally  we  assume  that  the'^marginal  distribution  of  A.  is  a Wishart  distriiution 
with  mj  degrees  of  freedom,  m.  > 2t-1,  and  precision  matrix  V-,  where  V-  is  a 
2t  X 2t  positive  definite  symmetric  matrix.  The  Wishart  distribution  is  extensively 
discussed  in  references  [f]  and  [g]. 


We  will  use  the  notation  h(-  [ m,  V)  to  represent  the  density  function  of  a Wishart 
distribution  with  m degrees  of  freedom  and  precision  matrix  V.  In  general  then, 
for  some  appropriately  chosen  constant  c,  we  have,  for  all  2t  x2t  positive  definite 
symmetric  matrices  a, 

h(a|  m,  V)  c [ v|  a|  _itr(Va)}  , 

where  | v|  ^ determinant  of  V.  Thus,  the  joint  density  function  for  F;  and  A-  has  the 
form  ^ ^ 


pr.fy.a)  = f(7  ly  v a)h(a|m  , V.), 
J J J J 3 


j = 1,2, 


We  now  indicate  the  significance  of  the  parameters  which  define  the  prior 
joint  density  pr.  Note  that  the  prior  choice  for  the  values  of  these  parameters 
must  be  based  on  past  experience  or  operational  and  geographic  constraints. 

The  choice  of  the  prior  parametric  vector  a is  governed  by  our  prior 
estimates  on  the  relative  likelihoods  of  the  scenarios  Sj^, . . . , Sj.  Indeed  if 
P - 01+. . . +cvj  then 


E(Q)  = /qpr  (q)  dq  = o p~^, 


and  so  the  components  of  a are  precisely  our  prior  estimates  for  the  relative 
likelihoods  of  the  various  scenarios. 


In  order  to  obtain  the  proper  prior  values  for  the  parameters  yj  and  Vj, 
wc  consider  the  prior  conditional  density,  pr,  on  target  tracks  Z given  that  the 
are  samples  from  scenario  Sj.  Observe  that 


j)  --  / / pr  (7\y-  J,  Fj  - y,  A.  a)  pr.  (y,  a)  dy  d! 


/ f(z  I y,  a)  f (y.,  (v^  a) ) h (a  I m^,  dy  da 


V. 

/ a)  h (a I m.,  V.)  da. 


(111-2) 


I 

I 


i 


I 
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To  evaluate  this  last  integral  we  need  to  define  the  t distribution  with  m > 0 
degrees  of  freedom,  location  vector  a e IR^'^.and  positive  definite  symmetric 
2t  X 2t  precision  matrix  T,  t(x \ m,  a,  T).  This  density  is  defined  for  all  xeIR 
by  the  formula 


t(x  I m,  CT,  T)  = c 1 T 


where  c is  a constant  chosen  so  that  the  density  integrates  to  1,  Observe  that, 
for  all  sufficiently  large  m,  the  t distribution  is  approximately  a normal  distribution; 


t(x  m,  CT,  T)  f(x  CT,  T). 


It  is  now'  an  easy  matter  to  evaluate  the  integral  on  the  right  hand  side  of 
equation  (III-2)  to  obtain 


pr 


(zl^=  3)  = t(zl  m.-2T  + l, 


V 

— L (m.-2T  + l)  V.”S 
H-Vj  j 3 


\ . 


1 +v. 
3 


(m.  -2t  + 1)  V.  ^), 
J 3 


(III-3) 


where  the  last  approximation  is  valid  provided  m^^  is  sufficiently  large.  Thus, 
our  prior  estimate  for  the  probability  distribution  on  target  tracks  which  are 
associated  with  scenario  Sj  is  approximately  a multivariate  normal  distribution 
with  mean  track  y.  and  covariance  matrix 


V.  = — i-v  (m.  - 2t  + 1)  V."^. 
3 '"j  + 1 3 3 


In  the  theory  of  Bayesian  statistical  analysis  the  Bayes  estimate  for  a parameter 
is  the  expectation  of  the  random  variable  representing  that  parameter.  Thus,  the 
prior  Bayes  estimate  for  the  vector  of  the  probabilities  p is 

E(Q)  o p \ P = cVj, 

and  the  prior  Bayes  estimate  for  the  mean  target  track  pj,  of  scenario  Sj  is 
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j - 1.  2 J. 


E (r^) 


It  is  also  possible  to  show  that  the  prior  Bayes  estimates  for  the  covariance 
matrix  and  precision  matrix  Zj  are  respectively 


E(A.-S 


1 

m.  -2t  + 1 
] 


V., 

J 


V 

i_ 

V.  I 1 
,1 


and 


E(A.,  » 


m.  V.  . 
] J 


See  for  example  reference  [g  ]. 


Estimating  Target  Motion  Parameters  When  the  Track  and  Motion  Scenario  Are  Known 

In  this  section  we  indicate  how  to  perform  a Bayesian  update  on  the  probability 
distribution  for  the  parameters  of  our  target  motion  model  so  as  to  reflect  the 
information  provided  by  a completely  known  target  track.  We  thus  assume  that 
we  are  considering  a target 

1)  which  is  following  scenario  S^  (so  that  I ),  and 

2)  whose  track  is  z - (z  , z . . . z ). 

1 w T 

In  this  case  we  will  be  able  to  compute  precisely  the  posterior  distribution  for 
the  parameters  of  our  target  motion  model.  In  the  other  cases  which  we  will 
consider,  it  will  be  necessary  to  make  a number  of  approximations  to  perform 
the  updating. 

Let 

J 

Pi"  y, Tt’  Pt',  ^ ' 

1 d 1 J 0 ) 1 ] 1 I 

be  the  joint  prior  probability  distribution  for  the  iTindoir,  : . in;  v 
and  A J,  ....  A ^ respectively.  We  are  assuming,  i.  m • 


prQ(q)  = g(q|  a),  and 


pr^(Y,  a)  = f(y  a)h(al  m^,  ). 


We  seek  the  joint  posterior  density 


P^(Qf  » Vji  • • • $ I ^ ) 

pr  (z  I q,  . . . , Yj,  a^, , . . , a.^,^=  I )Pr  Hq,  Yj^ ^1 

pr{z\^=l)  Pr  {J(=l) 

J 

xpro(q)  jfl^pr.CYj.aj). 

In  order  to  evaluate  this  quantity,  recall  from  the  previous  section  that 
pr(z|q,  Yj^.....rj,  0 = f(zlYpaj). 

^ I qi  y^$  • • • » Yj»  • • • * ~ ’ 

Pr(z|^=  1)  « f(zlY^  , ),  and 

Pr^^  Z}=  or^ /P,  P = a^+.  ..+aj. 

Substituting  these  quantities  into  equation  (ni-4)  we  obtain 


ri Yj.  a^, . . . , aj  I z,^=Z)  oc  ^ a~^q^  f(z  fy^  a^)  prjj(q)  ^liPy Yj^, 


K 


Observe  now  that 


(in-4) 


(m-5) 


Pttj^q^  pr^(q)  = Pa^^  q^  g(qlQf)  = g(ql  a*) 

where  a'*  = (a;j,...,a'^+l, ,aj).  Thus,  the  posterior  distribution  for  the  random 
variable  Q is  a Dirichlet  distribution  which  depends  only  on  the  scenario  followed 
by  the  target  in  question. 
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We  now  consider  the  product 


f(z|y,  a)  pr^(y,  a)  = l{z\y,  a)  f (y  ly^,  a)  h (a|  m^,  ) 

V. 

= f(rlY^*,  (v^  +l)a)f(z|y^,  a)  h (a|m^,V^ ), 


where 


y\  = (v^  % +z)  (v^  +l)"^ 


Using  the  Identity 


(ni-6) 


^ (z4, )'  a (a-?, ) = rnr 


we  can  write  the  second  normal  density  which  occurs  on  the  right  hand  side  of 
equation  (in-6)  as 


<2,r^-JL.a 

Itv,  I 


A A 


i' 


Combining  this  with  the  expression  for  the  Wishart  density  function  which  appears 
in  (in-6),  we  thus  obtain 

f(z|y,a)pr2  (y,a)  oc  f(y  1^*,  (v^ +l)a)  h (alm^ +1,  Vp, 


where 
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.1 


1 


We  conclude  finally  that  the  posterior  joint  distribution  for  the  random 

variables  Q,  n Tr.  Ai, . , , , Aj^,  given  that  the  target's  track  is 

z = (z z ) and  that  it  is  following  scenario  S , is 

It  <• 


pr(q,  , y , a^  i = 1) 


= g(ql«*)  V ^ “j*’ 


(in-7) 


where 


O'*  = Q!  + (0,  0, . , . , 1, . . .0), 


A*  _ 1 

) 1 A 

y, +z) 

\ V,  + 1 Z Z 

If 


(1  appears  in  the  Z component) 


if  j / Z 


m^+1 


If  3=^ 


if  j/  Z 


if  j = Z , 


if  j Z 


if  j = Z, 


if  j Z 


In  particular  observe,  in  this  case,  that  the  posterior  distribution  for  the  target 
motion  parameters  has  precisely  the  same  form  as  does  the  prior. 


Est<Tnat.inpr  Target  Motion  Parameters  When  the  Track  Is  Known  but  the  Motion 
Scenario  Is  Unknown 

Suppose  that  we  consider  a target  whose  track  z = (zj,  Z2, . . . , *7-)  is 
completely  known,  but  we  have  yet  to  determine  which  of  the  motion  scenarios 
Si,  S2, . . . , Sj  the  target  is  following.  In  this  case  we  would  like  to  estimate  the 
joint  posterior  distribution  for  the  target  motion  parameters  given  the  target 
track  z.  Observe  that  we  can  write 


pr(q,  . Tj.  a^, . , . , a ^ | z) 

= ^2  pr(q,  Yj.  a^^, . . . , a^  | z,^  I ) Pr  l\z}  . 


(III-8) 


The  first  factor  of  each  term  appearing  on  the  right  hand  side  of  equation  (III-8)  has 
already  been  determined  in  equation  (III-7).  We  thus  need  only  evaluate  the 
conditional  probability  that  the  target  is  following  scenario  S^  given  that  the  target's 
track  is  z.  To  do  this  we  employ  Bayes  theorem  to  write 

, pr(z|^=l ) Pr(^=l} 

VT{j^=l  Iz}  = j 

2^  pr(z|^=  j)  Pr{^=  3} 


Using  now  our  parametric  model  for  target  motion,  we  can  evaluate  the  quantities 
which  appear  on  the  right  hand  side  of  this  identity.  We  thus  obtain 


Pr{^=Z  I z)  = f(zly^,  V^)  [ 


f(z  I 


'^k*  'k 


A _1 

V.)] 


(III-9) 


Combining  equations  (III-7),  (III-8),  and  (ni-9),  we  are  able  to  compute 

the  joint  posterior  distribution  for  our  target  motion  parameters  given  only  the 
target  track  z.  Observe,  however,  that  this  posterior  has  a different  form  than 
the  prior.  Additionally,  this  posterior  involves  many  more  parameters  than  does 
the  prior,  and  this  unfortunate  state  of  affairs  can  rapidly  make  the  necessary 
calculations  impractical. 

The  cause  of  this  problem,  of  course,  is  the  impossibility  of  assigning, 
without  additional  data,  a given  track  to  a unique  scenario.  One  possible  solution 
would  be  to  use  equation  (III-9)  to  determine  the  highest  probability  scenario, 
and  then  assign  the  target  track  z to  that  scenario.  The  calculation  of  the 
posterior  joint  distribution  for  the  target  motion  parameters  would  then  be  carried  out 
as  in  the  previous  section.  Such  a scheme  is  particularly  meaningful  when  the 


-80- 


Although  we  are  unable  to  compute  tiie  posterior  joint  distribution 
pi'(Q>  Yi* . . ■ > Vjt  • I I z)  without  either  an  increase  in  the  number  of 
parameters  or  some  maximum  likelihood  estimating  scheme,  nevertheless 
we  can  give  a useful  approximation  for  the  marginal  posterior  distributions 
prQ(q|  z).  Suppose  that  pr^(q)  = g(qlQ;).  We  then  have 


prQ(ql  z)  = pr^(q|  z,^=  j)  Pr  z} 

= g(q  I Pr  (5^=  j I z} , 


(HI- 10) 


where 


Of 


a+(0,0,  ...,0) 


(1  appears  in  the 
coordinate). 


Observe,  however,  that  if  we  set 

a*  = Pr  {^=j|  z}, 

then  the  Dlrchlet  distribution  g('  |a*)  has  the  same  mean  as  does  the  sum  of 
Dirchlet  distributions  (III- 10).  Thus,  in  manv  applications,  it  may  be  reasonable 

to  approximate  the  posterior  distribution  prQ(q|  z ) with  the  Dirchlet  distribution 
g(q  1 O'*).  This  approximation  of  course  prevents  the  increase  in  the  number  of 
parameters  that  occur  in  equation  (in- 10). 


Target  Contacts 

It  seldom  happens  that  the  complete  track  of  a target  is  available  for  analysis. 
Indeed  it  may  not  even  be  known  which  of  the  various  scenarios  Sj, . . . , Sj  a given 
target  is  following.  Rather,  it  is  much  more  common  to  have  available  a collection 
of  target  contact  data  on  which  all  conclusions  about  target  motion  must  be  based. 
The  purpose  of  this  section  is  to  describe  our  model  for  target  contacts. 


2t 

Suppose  that  the  track  of  a specified  target  is  z = (zj^, . . . , z^)  e IR  , and  let 

D4.1.  be  the  kfh  contact  event  on  the  tararet  at  time  t Fach  contact  event  cnnelnta  of 


an  estimated  position  for  the  target  at  time  t and  a known  covariance  matrix 
We  assume  that  the  estimated  position  for  the  target,  given  that  the  contact  occurred, 
is  a random  vector  which  depends  only  on  the  actual  position  of  the  target  at  time 
t.  Moreover,  the  distribution  of  the  random  vector  d^j^is  a bivariate  normal 
distribution  with  mean  and  precision  matrix  Additionally,  we  assume  that 
the  random  vectors  dtk  are  all  mutually  independent.  Thus  the  joint  probability 
distribution  of  the  random  vectors  d^i^ given  that  Z = (z  , z , . . . , z ) is 

X A T 


Here,  for  zc  IR  and  A a 2 x 2 positive  definite  matrix,  we  use  f(- 1 z.  A)  to  represent 
the  bivariate  normal  density  function. 


Estimating  the  Track  of  a Target  from  Contact  Data 

Suppose  that  we  are  interested  in  obtaining  an  estimate  for  the  track  of  a 
specified  target  from  contact  data.  Recall  that  the  track  of  this  target  is  a 
random  vector  Z with  values  in  1R2t  our  prior  estimate  for  the  distribution 
of  Z depends  on  the  parameters  which  describe  the  uncertainty  in  our  model 
for  target  motion.  Thus  letQ,ri,  • • • * A j, . . . , Aj  are  the  random  quantities 

which  represent  our  state  of  knowledge  about  the  vector  of  scenario  weights 
P = (Pi»  • • • . Pj)i  the  mean  target  paths  . . . , ^j,  and  the  scenario  precision 
matrices  2^, . . . , 2j  respectively.  We  assume  that  the  joint  distribution  for 
these  quantities  has  the  density  function  pr  given  by  equation  (III-l)  where 
priq)  = g(q Ia)  and,  for  j = 1,2, . . . , J, 


prjfy.a)  = f(r Itj,  (Vj  a))  h (aim.,  V^). 


It  follows  from  equations  (111-2)  and  (III-3)  that  our  prior  estimate  for  the 
distribution  for  the  track  Z is  given  by  the  density  function  pr: 


Cl 

Pr(z)=jfiPr  (z|^=j)  Pr{^=j} 
* jJl  Vj)«j 


where  P = o , ^. . . + a and 
1 J 


Suppose  now  that  we  have  obtained  contact  events  on  the  target.  We 
thus  have  the  position  estimates  = I tk>  corresponding  precision  matrices 
Atk,  at  various  times  t,  1<  t<  r.  In  order  to  simplify  our  notation,  let  D be  the 
joint  contact  event 


^ ^^tk  ^tk’ 


We  seek  then  the  posterior  distribution 

J 


pr  (z|d)  = pr(D)  ^ .fj^pr(D  [ z,^=  j)  pr  (z|jr=J)  Pr{^=j}.  (in- 11) 


Since  the  distribution  of  d^k  depends  only  on  the  actual  position  of  the  target 
at  time  t,  and  not  on  the  target  scenario,  we  have 


pr  (dI  z,/=j)  = prpl  z)  = Zj^,  A^j^). 


For  each  time  t at  which  we  have  at  least  one  contact  on  the  target,  we  can  write, 
by  the  process  of  completing  the  square,  described  in  Appendix  A, 


I « tk- V < W = < VV  ^ ' VV " H 


where  is  a constant  independent  of  z = (zj, . , . , z^),  Aj  is  the  2x2  positive 
definite  symmetric  matrix 
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Note  that  these  calculations  require  nothing  more  difficult  than  inverting  a 2 x 2 
matrix.  Also  if  there  are  no  contact?  at  time  t,  let  be  the  2x2  zero  matrix, 
= (0. 0),  and  = 0. 

Now  let  A be  the  2t  x 2t  block  diagonal  matrix 


A = dlag  (A^.A^, ....  A^), 


let  uelR 


2t 


be  the  vector 


= a*.. 


.<y'y  . 

T 


kA 


and  let  L = L,  + L , We  can  then  write 

1 T 


pr(D  I z,^=  i ) = C ^ 


(III- 12) 


where  C Is  the  constant  (2v-)  ^ e \ and  ju  is  the  total  number  of  contacts.  Observe 


that  C is  independent  of  the  target  track  z as  well  as  the  scenario  followed  by  the 
target. 


At  this  point  it  is  convenient  to  compute  the  probability  of  obtaining  the 
collection  of  contact  data  D given  that  the  target  is  following  scenario  Sj. 
Indeed,  from  equation  (III- 12)  we  have 


pr(Dl^=j)  = / pr(D|  z,^=j)  pr(z|^=  j)dz 


A j A A _i  A 

- r p"2(o--y.)  V (V  + A)  A(a-7  ) 
~ 113  1 . 


(Ill- 13) 


where 


/ivj  |v,+Ai"\n  I A 


1 1 


tk  ’ tk' 


Also,  since  the  prior  probability  that  the  target  will  follow  scenario  Si  is 
=■  (P  = 01+. . . +0^),  we  have 

J 

pr(D)  - p~^  .2^  pr(D|^=j) 
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Consider  now  the  product 


pr(D  I z,^=  j)  pr(z  \^=  j)  = C a/h  ( ( e *^^£(2!%.^). 

tK  J J 

In  order  to  compute  this  density  note  that  again  by  completing  the  square  of  a 
quadratic  form,  we  can  write 

A 

(Z-V^)»  Vj(Z-V^)  + (Z-Or)'A(Z'CT) 

= (z-a^)'  (z-6^)  + A(a-^.), 

where 

A _i  ^ A 

'en  have 


I 


I 

L 


pr(D|  z,^=j)  pr  (zl^=j) 


AAA 


(in- 14) 


-1 


where  Cj  Is  the  constant  defined  above. 


Finally  we  can  compute  the  posterior  distribution  on  target  tracks  given 
the  collection  of  contact  data  D.  Indeed  combining  equations  (HI- 11)  and 
(in- 14)  we  obtain 


pr(z  I D)  = pr(D)‘ ^ |^f(z  1 6^,  U^)  e "'‘'j' ' V.  (V^  + A)  A (0-7^)  ^ 


nil- 15) 


Estimating  Target  Motion  Parameters  from  Contact  Data  When  the  Motion  Scenario 
Is  Known 

Let  the  prior  probability  density  function  for  the  random  quantities  Q, 
r^,  • • • I Tjt  A ^ j given,  as  before,  by 


where 


pr(q,  Y, .....  Vt.  a . , a ) = pr  (q)  n pr  (7  . a ) , 
1 J 1 J 0 ]=1  3 ] J 


pr^Cq)  = g(qlQ!).  and 

pr.(Y,a)  = f(YlY..  v.  a ) h (a|m.,  V^),  3 = 1.2 J. 

Consider  a target  which  is  known  to  be  following  scenario  S;  (so  that^=  1). 
Suppose,  however,  that  the  target  track  z = (zj, . . . , z,.)  is  unknown  but  tliat  we 
have  obtained  the  contact  events  at  various  times  t,  1 < t<  t.  Each  contact 
event  consists  of  a mean  target  position  estimate  d^j^  = Itk  together  with  a 
corresponding  covariance  matrix  Let  D be  the  joint  detection  event 


allt,  J). 

We  seek  the  joint  posterior  distribution  for  the  target  motion  parameters  conditioned 
on  the  joint  contact  event  D : 


p^^(q.  • • • . Yj>  ^ • • • . I ii»  ^ ^ ) 

pr(D  I q, Yj^)  ....  Yj>  . • • . ^ ^ ^ ^ • • • > Yj.  • • • > } (III-16) 

pr(D|^=Z)  pr  {j(=l) 

J 

•pr  (q)  n pr  (Y..a  ). 
w ]=1  3 3 3 


We  have  already  established  the  values  of  two  of  the  factors  which  appear  on  the 
right  side  of  (111-16).  Indeed, 

1*^  ^ I *3’  1^2’  • • • » • • • > ~ * and 

Pr  {^=  0 = P ^ (P  = 0!^+. . . +Q^). 


Also,  since  each  estimated  position  for  the  target  is  a random  vector  which 
depends  only  on  the  actual  position  zt  of  the  target  at  time  t,  the  likelihood  function 
pr(D  I q,  Yif  • • • . Vj.  a , . . . , a i ) is  independent  of  the  parameters  y and  a 
for  \ ^ I . We  thus  c^  write^  ^ ^ 


pr(D  1 q,  y^,  a^, , . . , aj,^^  1)  pr(D  I Yj » a^  ’ ^ 


Substituting  these  quantities  into  equation  (III- 16)  we  obtain 


pr(q»  Yj^i  • • • I Yj»  a^^, . . . , 3.J  1 D,  1) 


prply^,  «^^Qpro(q)  " pr.(Y..  a.) 


(111-17) 


But  observe  that 


prQ(q)  = P g(q 


), 


where  a*  = (a^, , . . , + 1, . . . .oj).  Thus  the  posterior  distribution  for  the 

random  variable  Q is  a Dlrchlet  distribution  which  depends  only  on  the  scenario 
followed  by  the  target  in  question. 

In  order  to  complete  the  evaluation  of^III- 17)  we  need  to  compute  the 
likelihood  function  pr(Dly^,  a;  - 1).  Since  the  distribution  of  each  d^j^  depends 
only  on  the  actual  position  of  the  target  at  time  t,  we  have 


prpjy^,  a^,  0 pr(Di  z)  pr  (zly^,  a^,^-  i)  dz 


=J'pr(D\z)  f (zIy^.  a^)  dz. 


(ni-18) 


Recall  now  from  equation  (111-12)  that  we  can  find  a constant  L,  a vector 
crelR^'''  and  a 2t  x 2t  block  diagonal  positive  semideflnlte  matrix  A such  that 


pr(D|z)  = c/n  ^ 


t,  k 


L„ 
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where  C is  the  constant  (2-tr)  ^e  , and  n is  total  number  of  contacts.  By  the 
process  of  completii^  the  square  described  in  Appendix  A,  we  can  find  a vector 
ye  such  that 


(z-ff)'A(^-CT)  + (7-y^)a.^ 


-1 


- (z-y)’  (A+a^)  (z-y)  + (a-r^)'A(a^  +A) 


Thus,  evaluating  the  Integral  in  Equation  (III- 1- ' we  obtain 


pr(D|v^,  a^,  - Z)  = C V UJ  U^+aI  e A(ct-v^). 


In  order  to  complete  our  evaluation  of  the  posterior  - .ensity  given  in  (III-17), 
we  must  consider  the  product 
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I -1  “1 

U=-^-T  (a,+A)  A,  and 

v^+l  M ' 


* V, 


a^  + A I Act} 


Thus  for  an  appropriately  chosen  constant  , we  can  write 


prply^,  a^.  /=  0 pr^  (T^.  


C/fcy^iV* 


|a^+(l+l/v^)  A 


h(a^  |m^,V^  +V{(T-y^)  (ct- 


^ In  order  to  continue  our  calculations,  it  is  necessary  at  this  point  to  make 

1 a number  of  important  approximations.  First,  note  that  for  each  positive  definite 

C matrix  a,  we  can  find  a number  6 (a.,)  such  that 

f i l 

( 

^ V 

[ tr  a^  = Il-e(a^)  ] tr  A. 

i ^ 

t 

i 

! Observe  that  If  V-  is  sufficiently  large  we  then  have 

[ ^ 

f V 

I T = (a^+A)"^(-J^a^+A)«  e(a^)I. 

; I 

I Here  I is  the  2t  x 2t  identity  matrix.  It  is  easy  to  see  that  Vjr  (v;+  l)~^<e(a^)<  1. 

I The  quantity  6(^i)  is  important  in  that  it  provides  a measure  for  the  amount  of 

1 target  localization  information  provided  by  the  contact  data  D relative  to  that 

[ provided  by  the  precision  matrix  a/ . Indeed  if  no  contacts  were  obtained, so 

that  A = 0,  then  ^(a^)  = v^  (v^  +1)"^.  Similarly,  if  we  obtained  a contact  at  each 
- time  t,  t = 1,  2,  , . . , t,  then  0(a^)— 1 as  the  precision  of  the  target  contact  data 

[ converges  to  the  target's  actual  track, 

b 

Y 

I Using  now  this  approximation  we  can  write 


T«e(a)I, 

i 


1 


4 


r 


u 


I 1 / .v-l  * J 

7 -TT, — r (^I  +A)  A,  and 

V +1  fl(a  ) ' / ' ’ 


V r ~ 

ihn  * 

t L f _i  -J 


Next  note  that,  for  each  positive  definite  matrix  a^ , there  is  a number 


(p{3-j^)  such  that 


+A(1  +^)  1 


\n^i) 


Since  1 a^  +A(l+\^)  ( >11^  jIajj  ( , it  follows  that  0<<;p(a^  )<  1.  The  quantity  <p(aj) 
is  a second  measure  for  t\ie  amoxmt  of  target  localization  Information  provided 
by  the  contact  data  D relative  to  that  provided  by  the  precision  matrix  a^.  Indeed, 
if  there  is  no  contact  data  then  <;«’(a^)  0.  Also  if  a contact  was  obtained  at  each 

time  t,  t = 1,  2,  , . . , r,  then  <;0(aj)— 1 as  the  precision  of  the  target  contact  data 
converges  to  the  target's  actual  track. 

In  order  to  obtain  computationally  useful  results  it  is  necessary  to  obtain 
estimates  for  the  quantities  T,  U,  0,  and  cp  which  do  not  depend  on  the  precision 
matrix  aj.  To  obtai^  such  estimates,  we  will  replace  aj  with  a certain  expected 
value.  Note  that  A;  is  a random  matrix  which  represents  the  covariance  matrix 
. Thus,  the  quantity 

''i 


is  the  Bayesian  estimate  for  the  matrix  . We  will  approximate  the  matrix  a 


with  the  matrix 


'I 


V®  - E{A'^"^  = (m^  -2t  + 1)  v"\ 


Observe  that  = v^  (v^  1)  ^ V°, 
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i 


Now  set  and  (p^  = <p(V^ ).  We  then  have  the  approximations 


T « I. 


U 


y*  K [v^  +iv^  + A]  ^ A a],  and 


<Pl  (a^ ) « <P^ 


Substituting  these  approximations  into  equation  (111-17)  we  conclude  finally  that 

the  posterior  distribution  for  the  random  quantities  Q,  , Ij,  and  Aj^ A 

given  the  joint  contact  data  D on  a target  and  the  fact  that  the  tai^et  is  following 
scenario  , satisfies 


Pr(q, 


• > 


^1»  * • * ’ I ^ 

J 

g(ql  O'*)  V ^ 


(III- 20) 


where 


a*  = a ^ (0, 0, . . , , 1, , . . , 0),  (1  appears  in  the  I component), 

y.  4 I 

r.*  = ^ 

' j 

^i^yi  ^A]"^  Aa}  if  j = i , 

X 


V, 

J 

if  j 1 

(v^+i)e^ 

If  j = ^ 

m. 

if  j 

1 

m,  +<p, 

if  j = Z , and 

Estimating  Target  Motion  Parameters  from  Contact  Data 

Suppose  now  that  we  have  obtained  the  joint  contact  event  D on  a given  target, 
but  we  have  yet  to  determine  which  of  the  motion  scenarios  Sj^,  S2, . . . , Sj  the  target 
is  following.  This  is  the  most  likely  form  that  the  available  data  will  take.  We 
would  like  to  estimate  the  joint  posterior  distribution  for  the  target  motion 
parameters  from  the  contact  data  D.  Observe  that  we  can  write 

pr(q,  Yj, . . . , 7j.  a^, . , , , a^  1 D) 

= 2 Pr(q,'yj^,  . . .,Tj,  a^, , .,,aj  |d,  J = Z)  Pr  (^=  Z |d).  (III-21) 


The  first  factor  of  each  term  appearing  in  (III- 21)  has  already  been  evaluated 
in  equation  (111-20), 


To  complete  the  evaluation  of  (III- 21),  we  need  only  compute  the  conditional 
probability  that  the  target  is  following  scenario  given  the  contact  data  D.  To 
do  this  we  employ  Bayes  theorem  to  write 


Recall  now  that  Pr{^=  Z } = P t P - 01+.  • • Also  the  probability  of 
obtaining  the  contact  data  D given  the  target  is  following  scenario  S;  has  been 
evaluated  in  equation  (HI- 13).  Thus  combining  equations  (III-20),  (111-21), 

(in- 22),  and  (III- 13)  we  are  able  to  estimate  the  joint  posterior  distribution 
for  our  target  motion  parameters  given  only  the  target  contact  data  D,  Observe, 
however,  as  in  the  case  where  the  target  track  is  known  but  the  motion  scenario 
is  not,  this  posterior  has  a different  form  than  the  prior  and  Involves  many 
more  parameters  than  does  the  prior. 

The  cause  of  this  problem  is,  much  as  before,  the  impossibility  of  determining 
solely  from  contact  data  which  scenario  a target  is  following,  A possible  solution 
to  this  problem  would  be  to  use  equation  (III-22)  to  determine  the  highest  probability 
scenario,  and  then  assign  the  target  to  that  scenario.  The  calculation  of  the 
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posterior  joint  distribution  for  the  target  motion  parameters  would  then  be  carried 
out  as  in  the  previous  section.  Again,  such  a scheme  is  particularly  meaningful 
when  the  scenarios  are  widely  separated  in  space  and  the  contact  data  fit  one 
scenario  significantly  better  than  any  other  scenario. 


CH_^PTE_R_IV_ 

ASW  INFORMATION  PROCESSING  IN  A MULTI-TARGET  ENVIRONMENT 


In  this  chapter  we  develop  Bayesian  statistical  methods  for  processing  ASW 
Information  In  a multi-target  environment.  A potential  application  of  this  work 
Is  In  ASW  information  processing  systems  designed  to  accept  ASW  Information 
from  diverse  sources,  Including  direct  surveillance,  In  an  effort  to  Initiate  and 
maintain  simultaneous  localizations  on  a number  of  submarine  targets.  Such 
systems  can  be  used  both  to  carry  out  ASW  surveillance  and  to  assess  vulnerability 
to  such  surveillance  by  others. 

The  chapter  Is  divided  Into  a number  of  sections.  The  first  section  Is  an 
Introduction  and  provides  the  necessary  background  to  place  the  current  work  on 
Information  processing  In  perspective.  The  second  section  provides  a brief 
description  of  ASWIPS  (ASW  Information  Processing  System),  a small  scale 
system  developed  as  a testbed  for  Information  processing  methodology.  The  third 
section  Involves  a theoretical  discussion  of  Information  processing  algorithms. 

Finally,  the  fourth  section  presents  some  numerical  results  based  on  ASWIPS 
comparing  a number  of  alternative  Information  processing  algorithms.  Extended 
Memory  processing  is  described  In  detail  In  Appendix  B. 

Backgroimd  and  Introduction 

Suppose  that  a number  of  submarine  targets  are  known  to  be  operating  In  a 
more  or  less  well  defined  ocean  area  and  periodic  estimates  are  to  be  generated 
for  each  of  their  positions.  Potential  sources  of  Information  to  assist  in  target 
force  localization  Include:  water  depth  contours;  submarine  speed  limitations;  i 

direct  surveillance  Information  such  as  observations  of  port  arrivals  and  depar- 
tures and  ASW  sensor  detection  data;  geographic  constraints  Imposed  by  submarine 
missions;  and  so  on.  We  treat  this  localization  problem  in  terms  of  ASW  surveillance 
information  processing  systems,  1.  e. , systems  designed  to  accept  as  Input  Information 
of  the  type  described  and  produce  as  output  estimates  of  the  locations  of  the  target 
submarines. 

Our  principal  Interest  In  this  chapter  Is  In  Bayesian  statistical  methods  for  the 
systematic  generation  and  updating  of  target  location  predictions  In  a multi-target 
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environment.  By  a multi-target  environment  we  mean  one  in  which,  in  general, 
there  are  situations  in  which  it  is  not  possible  to  determine  the  target  that 
generated  a given  sensor  contact.  For  example,  it  may  be  that  the  submarine 
targets  are  acoustically  indistinguishable.  In  addition,  it  is  assumed  that 
individual  targets  are  not  confined  to  operate  in  disjoint  subregions.  Under 
these  restrictions  it  is  not  possible  to  treat  the  multi-target  surveillance  problem 
as  a composite  of  many  independent  single  target  problems  even  though  the 
underlying  target  motions  may  be  statistically  Independent,  It  is  the  information 
processing  dir  ulties  arising  from  this  intrinsic  multivariate  nature  of  the 
problem  tha  ' the  concern  of  this  chapter, 

A substantial  portion  of  the  discussion  below  will  deal  with  specific  multi- 
target Bayesian  information  processing  algorithms  and  their  comparative  evaluation, 
A particular  such  algorithm  called  Extended  Memory  processing  will  emerge  as 
the  only  one  of  the  Bayesian  approaches  we  consider  that  both  makes  accurate 
use  of  the  observational  data  and  is  computationally  practical. 

In  order  to  provide  the  Extended  Memory  methodology  with  a framework  within 
which  it  can  be  Implemented,  tested,  and  compared  with  other  processing  approaches, 
we  have  put  together  a small  scale  developmental  information  processing  system 
called  ASWIPS  (ASW  Information  Processing  System).  We  next  proceed  to  describe 
ASWIPS. 


Structure  of  ASWIPS  (ASW  Information  Processing 


ASWIPS  is  a small  scale  information  processing  system  of  computer  programs 
currently  implemented  on  a PRIME  400  minicomputer  designed  to  accept  as  input 
contact  data  on  a small  number  of  targets  and  using  Bayesian  statistical  methods 
to  generate  target  location  estimates.  The  principal  output  of  ASWIPS  at  the  end 
of  each  processing  update  stage  consists  of  probability  distributions  for  the 
location  of  each  active  target. 

Target  locations  are  assumed  to  be  discretized  into  1°  x 1°  cells  within  a 
10  X 10®  overall  planar  grid.  Target  locations  are  thus  specified  in  terms  of 
probability  distributions  on  this  100  cell  grid.  During  the  time  between  two 
successive  updates  a target  can  either  remain  in  its  initial  cell  or  move  to  one 
of  the  eight  adjacent  cells  (five  cells,  if  the  initial  cell  shares  an  edge  in  common 
with  the  region  boundary;  three  cells,  if  it  is  a comer  cell)  in  the  two-dimensional 
grid.  Each  of  these  possible  transitions  is  assumed  to  have  equal  probability. 

Thus  the  motion  model  has  a very  simple  Markovian  structure. 

Observational  information  is  input  to  ASWIPS  in  the  form  of  probability  dis- 
tributions on  (target-like)  objects  detected.  Thus,  the  raw  contact  data  are 
effectively  assumed  to  have  been  preprocessed  before  being  input  to  ASWIPS. 

Each  object  location  distribution  is  assumed  to  correspond  to  an  observation  on 
one  of  the  targets  in  the  operating  area. 
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The  targets  are  assumed  to  be  effectively  indistinguishable  in  that  sensor  contacts 
cannot  be  uniquely  associated  with  the  target  causing  them.  It  should  be  noted, 
however,  that  in  some  cases  such  an  identification  may  be  possible  by  statistical 
inference.  This  would  be  the  case  when  an  object  location  distribution  Is  out  of 
sensor  contact  range  of  all  but  one  of  the  prior  target  location  distributions. 

In  general,  however,  an  object  location  distribution  may  overlap  with  more  than 
one  prior  target  location  distribution  in  which  case  the  identity  of  the  target  that 
caused  the  underlying  responses  is  ambiguous.  In  such  a case  we  will  refer  to  the 
object  location  distribution  (and  the  underlying  response  pattern)  as  unresolved. 

The  updating  of  target  locations  distributions  is  performed  at  the  end  of  each 
update  stage.  First,  the  target  location  distributions  as  produced  by  ASWIPS 
at  the  end  of  the  preceding  processing  stage  are  updated  for  target  motion.  This 
is  done  by  applying  the  one-stage  Markov  transition  operator  for  the  motion  model 
that  we  described  earlier.  The  effect  of  this  motion  updating  is  to  make  the  target 
location  distributions  more  diffuse,  reflecting  the  fact  that  target  motion  since 
the  last  update  has  increased  the  degree  of  our  uncertainty  about  target  locations. 

The  second  phase  of  processing  at  the  end  of  each  update  stage  involves  revising 
the  target  location  distributions  for  the  contact  information  obtained  since  the  last 
update.  We  will  reserve  our  discussion  of  the  specifics  of  the  updating  algorithms 
used  for  the  next  section  of  this  chapter. 

It  should  be  observed  that  just  as  target  motion  tends  to  diffuse  the  target 
location  distributions,  contact  information  tends  to  concentrate  them.  It  Is  this 
continuing  tug  of  war  between  the  loss  of  information  resulting  from  target  motion 
and  the  gain  of  information  resulting  from  contact  information  that  contols  the 
dynamics  of  the  target  localization  capability  of  ASWIPS.  Factors  which  control 
the  direction  that  this  information  struggle  will  take  include  the  number  of  targets, 
target  speed,  the  size  of  the  operating  area,  contact  data  rate,  sensor  detection 
ranges,  etc.  and  of  equal  importance,  the  processing  algorithms  themselves. 

This  brings  us  to  the  topic  of  the  next  section. 


Discussion  of  Information  Processing  Algorithms 


(k) 

In  the  case  of  ASWIPS,  target  lo^tlons  are  discretized  into  cells  so  that  X is  the 
index  of  the  cell  containing  the  n target  at  stage  k.  Under  the  presumption  that 
these  target  locations  are  uncertain,  the  objective  of  Information  processing  is  to 
estimate  these  locations  in  terms  of  probability  distributions.  Thus  we  can  think  of 
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as  a random  N-vector  whose  probability  distribution  we  seek.  Specifically,  using 
all  available  information,  we  would  like  the  information  processor  to  estimate  the 
joint  frequency  function 

(Xj,  X2,  • . . . Xj^)  - Pr  = (x^,  Xg ^ > (IV-1) 

where  x e I for  n = 1, 2,  . . , , N with  I being  the  collection  of  all  cell  indices, 
n 

Before  pursuing  the  question  of  how  P'  ' is  to  be  estimated  we  consider 
the  question  of  how  many  values  It  takes  to  specify  this  location  frequency  function. 
Consider  a case  in  which  the  operating  area  grid  contains  approximately  1, 000 
1°  X 1°  cells  and  the  number  of  targets,  N equals  10,  The  number  of  possible 
states  of  the  system,  each  of  which  corresponds  to  a joint  location  of  the  10  targets, 
is  1, 000^®  = 10^®  which  Is  obviously  prohibitively  large  from  a computational 
standpoint. 

Suppose  now  that  it  can  be  assumed  that  the  N targets  are  operating  independently. 
If  we  define  the  marginal  target  location  distributions 

p2‘>(x)=Pr{X«.x), 


then,  assuming  Independence,  we  obtain  the  relationship, 

N 

p”  <’‘i’  *2 V = „Ji  *V- 

\ 

If  the  relationship  in  equation  (IV-2)  were  valid  and  remained  valid  throughout 
information  processli^,  the  numerical  size  of  the  problem  of  estimating  target 
location  distributions  would  be  sharply  reduced.  Going  back  to  our  example,  we 
observe  that  each  p^),  1 = 1, 2, , . . , 10  Involves  the  specification  of  up  to  1,  000 
values.  Thus  the  total  number  of  values  required  to  specify  all  target  location 
distributions  would  be  only  10, 000,  well  within  the  capability  of  modern  computers. 


A problem  unfortunately  arises  from  the  fact  that  even  if  the  relationship 
in  equation  (rV-2)  were  valid  at  the  outset  of  processing,  i.  e. , before  any  sensor 
contacts  were  processed,  and  even  if  the  individual  target  motions  were  statistically 
independent,  equation  (IV-2)  in  general  would  not  continue  to  hold  during  the  course 
of  processing.  A very  simple  example  serves  to  illustrate  this  fact. 


Suppose  we  consider  a case  in  which  the  operating  area  consists  of  only  two 
cells,  each  containing  a single  sensor  which  can  detect  a target  only  in  its  own 
cell.  Suppose  there  are  two  targets  operating  independently  each  of  which  is  a 
priori  initially  equally  likely  to  be  in  either  of  the  two  grid  cells.  By  assumption, 
then  equation  (rV-2)  holds  initially.  Now  suppose  that  during  the  first  update 
period  each  of  the  two  sensors  reports  a contact.  Designate  the  two  targets  as 
T and  T and  the  two  grid  cells  as  C and  C . Then  it  is  easily  seen  that  there 
are  only  iwo  possibilities;  target  is  in  ceU  and  target  is  in  cell  C^,  or 
vice  versa.  Based  on  our  observational  data  alone,  we  cannot  conclude  with  certainty 
which  of  these  two  equally  likely  possibilities  is  the  correct  one,  although  we  can 
with  certainty  eliminate  the  possibility  that  the  two  targets  are  in  the  same  cell. 

One  now  observes  an  interesting  phenomenon.  Whereas  before  the  contacts  were 
reported  the  target  locations  were  statistically  Independent,  after  the  contacts 
and  occur,  the  target  locations  are  completely  correlated.  Specifically, 
knowledge  of  the  location  of  one  target  implies  deterministic  knowledge  of  the 
location  of  the  other  target.  If  target  is  in  cell  C^,  then  necessarily  target 
T„  is  in  cell  — thus  the  complete  correlation  between  the  two  targets.  We 
will  refer  to  the  problem  of  properly  incorporating  this  contact-induced  correlation 
between  targets  into  Information  processing  as  the  target  coupling  problem. 

One  might  reasonably  argue  that  the  example  we  have  given  is  extreme  and 
that  in  more  realistic  cases  the  degree  of  correlation  among  target  location 
distributions  induced  by  the  sensor  data  is  likely  to  be  small.  Suppose  for  the 
time  being  we  assiune  that  the  target  location  distributions  can  in  fact  be  treated 
as  approximately  independent  and  we  proceed  naively  on  the  basis  of  this  assumption. 
This  means  that  we  will  deal  only  with  the  individual  marginal  distributions  p'  for 
target  location.  The  problem  then  becomes  one  of  devising  a procedure  for  updating 
these  marginals  to  reflect  sensor  data.  A standard  and  well-studied  procedure 
for  this  kind  of  updating  is  based  on  Bayes'  theorem  on  conditional  probabilities. 

In  the  next  two  subsections  below  we  describe  two  possible  target  location 
distribution  updating  schemes  based  on  the  independence  assumption  and  Bayes' 
theorem. 


Sequential  processing.  We  assume  that  the  sensor  contacts  reported  during 
an  update  stage  have  been  clustered  into  sensor  response  patterns  each  generated 
by  a single  target.  We  associate  with  each  response  pattern  the  presence  of  a 


(target-like)  object.  Under  sequential  processing  the  target  location  ^distributions 
are  collectively  updated  once  for  each  such  response  pattern.  Let  p^  ^(x)  can  be 
obtained  recursively  from  p^*^»  ^ “^)(x). 

Assume  L response  patterns  were  observed  during  the  k^^  stage  and  let 
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(x)  = Pr{Z observed  target-like  object  is  in  cell  x at  stage  k}, 

/ = 1,  2 L 

be  the  associated  observed  object  location  distributions.  We  then  let 
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The  quantity  v/^’  is  a measure  of  the  fit  between  the  I object  location 
distribution  in  k^^  stage  processing  and  the  target  location  distribution  for  the 
n target  based  on  processing  the  first  I -1  object  location  distributions. 
Probabilistically,  V^’  ^Ws  an  estimate  of  the  likelihood  that  the  I object  and 
the  n^”  target  are  in  the  same  cell  at  the  k*”  stage.  The  quantity  [g 

an  estimate  of  the  probability  that  the  I object  and  n^k  target  are  one  and  the 


same. 


til  th 

Now  there  are  two  possibilities;  either  the  n target  is  the  same  as  the  I 
object  or  it  is  not.  If  it  is  not,  t^en  there  is  no  reason  to  modify  our  current 
estimate  of  the  location  of  the  n*  target  based  on  XZ  ; if  the  n^k  target  and  the 
Z ^k  object  are  the  same,  then  a composite  revised  estimate  of  the  location  of  the 
n^k  target  should  be  made  combining  the  information  contained  in  the  prior 
distribution  for  the  location  of  the  n^k  target  with  the  information  contained  in 
the  Z ^'k  object  location  distribution. 


In  mathematical  terms,  using  Bayes'  theorem  we  then  obtain 

o(k,  Z -l),^.,(k) 

(k.Z),  . f,  (k,Z)i  (k.Z-1).  . (k,  Z)  Pn  ' '' 
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(IV-5) 
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The  expressions  in  equations  (IV-3)  through  (IV-5)  provide  the  promised  recursion. 
One  proceeds  in  this  fashion  iteratively  updating  for  each  object  location  distribution 
associated  with  a given  update.  The  entire  series  of  updates  for  sensor  information 
during  a processing  stage  is  carried  out  following  an  update  for  target  motion  at 
the  beginning  of  each  such  processing  stage. 
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Parallel  processing.  Under  sequential  processing  the  object  location  distributions 
during  a processing  stage  lead  to  a series  of  updates  to  the  target  location  distributions. 
An  alternative  mode  of  processing,  which  we  call  parallel  processing,  would  attempt 
to  perform  a single  update  for  all  such  object  location  distributions  simultaneously. 

To  carry  out  the  processing  of  target  location  distributions  in  parallel  we 
introduce  the  notion  of  an  assignment  function.  This  is  a map  f such  that 

f:{l,  2,  ...,  L}^{1,  2 N), 


where  L is  the  number  of  object  location  distributions  being  processed  in  parallel. 

In  general  an  assignment  function  need  not  be  one-to-one  since  more  than  one  object 
location  might  have  been  generated  by  the  same  target.  In  the  current  presentation, 
for  simplicity,  we  will  assume  that  assignment  functions  are  one-to-one,  so  that  in 
particular  L<  N.  We  then  define 


= the  class  of  all  possible  one-to-one 
assignments  of  stage  objects  to 
targets. 


Parallel  processing  is  then  carried  out  as  follows.  First  set 


L 
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Z=lxcS 


(x)A^  '(X), 


(rv-6) 


where  p , as  defined  earlier,  is  the  location  distribution  of  the  n target 
through stage  processing  and  is  the  I th  object  location  distribution 
observed  during  the  k^^  stage.  Next  normalize  to  obtain 
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The  quantities  w^^)  given  in  equation  (IV-7)  are  direct  analogues  of  the  quantities 
“n  ’ ^ defined  by  equation  (IV-4)  except  that  whereas  in  sequential  processii^  object 
locations  were  matched  with  targets  one  at  a time,  under  parallel  processing  a 
simultaneous  assignment  of  all  objects  to  targets  Is  made. 
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The  update  of  the  target  locations  is  then  based  on  the  following  relationship: 


(X) 


(k-1)  (k) 
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where  f*  is  the  inverse  function  of  f,  and 


A 


(k) 

f*(i) 


= 1. 


if  f*(i)  is  not  defined. 


Considerable  experimentation  has  been  carried  out  using  both  the  sequential  and 
parallel  methods  of  processing.  Experience  has  shown  that  both  approaches 
function  reasonably  well  when  the  actual  locations  of  the  targets  remain  spatially 
separated  so  that  there  is  little  possibility  for  confusion  of  their  identities. 

However,  as  the  number  of  targets  increases  and  correspondingly  the  likelihood 
that  unresolved  object  location  distributions  will  be  generated  also  increases, 
the  assumption  that  the  target  locations  can  be  treated  even  approximately  as 
independent  seriously  degrades  target  location  prediction  performance,  the  problem 
being  more  acute  in  the  case  of  sequential  processing  than  parallel  processing. 

The  specific  symptoms  of  the  breakdown  in  processor  performance  include 
(1)  possible  permutation  of  target  identities  and  (2)  the  doubling  up  of  target 
location  distributions.  The  permutation  of  target  identities  occurs  when  the 
estimated  location  distribution  of  target  T , for  example,  corresponds  closely 
to  the  true  position  of  target  T^,  and  vice  versa.  The  doubling  up  phenomenon 
occurs  when  estimated  location  distributions  on  two  targets  appear  to  be  virtually 
identical  when  in  fact  there  is  only  one  actual  target  whose  position  is  compatible 
with  these  distribution^.  In  such  a case,  for  example,  the  computed  location 
distributions  of  both  targets  T and  T^  may  accurately  fit  the  actual  location  of 
target  T^  while  target  T^  is  actually  off  in  another  part  of  the  operating  area 
with  a location  not  covered  by  any  of  the  computed  target  location  distributions. 

The  two  phenomena  of  target  identity  permutation  and  doubling  up  are  related 
in  that  one  may  load  to  the  other;  both  are  extremely  damaging  to  proper  prediction 
performance.  Typical  examples  Illustrating  this  kind  of  anomalous  behavior  using 
ASWIPS  will  be  given  in  the  next  section. 

In  summary,  ir  work  has  shown  that  it  appears  impossible  to  obtain  a completely 
satisfactory  information  processing  model  based  on  the  assumption  of  target  location 
independence.  .As  a result  we  have  developed  a third  form  of  processing,  called 
Extended  Memory  processing,  which  is  based  on  the  key  observation  that  while  the 


target  locations  are,  in  general,  not  incicpendent,  they  are  conditionally  independent 
given  a specific  assignment  of  all  past  sensor  responses  to  the  targets.  This  simply 
says  that  if  one  knew  which  object  location  distributions  (or  more  fundamentally 
which  raw  sensor  responses)  were  caused  by  which  targets,  i,  e, , if  one  knew  the 
correct  response  pattern  to  target  association,  then  the  correlation  in  target 
locations  induced  by  such  observational  data  would  disappear.  In  general,  the 
correct  such  association  is  not  known.  However,  the  target  locations  can  be 
effectively  uncoupled  by  first  conditioning  on  a possible  response  pattern  to  target 
association,  then  performing  a conditional  update  of  target  locations  based  on  such 
an  association,  and  finally  removing  the  conditioning  by  averaging  over  all 
possible  such  associations. 

Observe  that  under  Extended  Memory  processing,  conditional  target  location 
distributions  must  be  retained  in  processor  memory  for  each  association  assigned 
a nonzero  weight.  Extended  Memory  processing  thus  requires  a substantially 
expanded  memory  capability  over  the  other  processing  algorithms  and  this,  as 
we  shall  see  in  the  next  section,  principally  accounts  for  its  improved  prediction 
capability. 

We  remark  that  Extended  Memory  processing  is  a form  of  generalization  of 
parallel  processing.  Under  parallel  processing  the  conditioning  is  based  on  an 
assignment  to  the  targets  of  the  response  patterns  observed  during  a single  update 
period.  Under  Extended  Memory  processing  the  conditioning  is  based  in  effect 
on  a simultaneous  assignment  to  the  targets  of  all  response  patterns  observed  to 
date. 

The  mathematical  description  of  Extended  Memory  processing  is  somewhat 
more  Involved  than  that  of  sequential  or  parallel  processing.  Thus  a detailed 
theoretical  development  of  Extended  Memory  processing  is  relegated  to  Appendix 
B which  also  includes  some  comments  on  the  computing  and  memory  storage 
requirements  imposed  by  this  method  of  processing. 


Comparison  of  Processing  Algorithms  Using  ASWIPS 

This  section  will  be  devoted  to  a series  of  numerical  examples  and  comparisons 
of  the  various  updating  algorithms  we  described  in  the  preceding  section.  All  such 
numerical  results  have  been  obtained  using  the  computer  model  ASWIPS. 

Example  1.  In  our  first  example  we  assume  that  there  are  three  targets 
operating  in  a 10  x 10  cellular  grid  according  to  the  motion  model  described 
earlier  under  which  during  a given  update  stage  a target  is  equally  likely  to 
move  from  its  current  cell  to  any  one  of  the  adjacent  cells. 

The  assumed  target  tracks  are  shown  in  Figure  IV-1  and  the  prior  distributions 
for  the  three  target  locations  are  shown  in  Table  IV-1  below.  Circled  target  locations 
in  Figure  fV-l  indicate  target  detections  at  those  locations.  We  assume  that  near 
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FIGURE  IV-1 


ACTUAL  TARGET  TRACKS 
(Example  1) 


Note:  k indexes  update  stages  win  the  corresponding  target  locations  as  indicated. 
Circled  target  locations  indicate  detections 
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table  IV-1  (continued) 


the  end  of  the  first  update  stage,  two  contacts  are  observed:  one  in  the  cell  (5, 5) 
and  a second  in  the  cell  (6, 6).  The  sc^scrs  involved  are  assumed  to  have  very 
short  range  and,  therefore,  a contact  guarantees  the  presence  of  a target  in  the 
cell  containing  the  sensor.  Thus  two  object  location  distributions  are  observed 
durii^  stage  1:  the  first  placing  a target  with  unit  mass  in  cell  (5,  5),  the  second 
placing  a target  in  the  cell  (6, 6). 

After  an  update  for  motion  during  the  first  stage  and  processing  for  the  first 
stage  contacts,  the  posterior  target  locations  under  Extended  Memory  processing 
are  as  shown  in  Table  IV -2. 

One  observes  from  Table  rV-2  that  both  of  the  contacts  are  unresolved.  The 
contacts  in  cells  (5, 5)  and  (6,6)  could  each  have  been  caused  by  any  of  the  three 
targets.  Consequently,  Table  IV-2  shows  a sharp  concentration  of  target  mass 
in  cell  (5,  5)  in  the  location  distribution  of  each  target  and  a similar  peak  in  cell 
(6,6). 

We  now  turn  to  the  second  processing  stage.  We  assume  that  during  this 
stage  a single  contact  was  obtained  in  cell  (3,  8).  Referring  to  Table  IV-2  one 
observes  that  even  allowing  for  target  motion  only  T^  could  have  caused  this  contact, 
and  therefore  it  is  known  with  certainty  that  T is  located  in  cell  (3,  8)  at  the  end 
of  the  second  update.  (Again  we  assume  that  Qie  contact  occurred  at  or  near  the 
end  of  the  update  period. ) Now  it  becomes  possible  to  fit  some  pieces  of  information 
together.  T^  is  known  to  bv,  in  cell  (3,  8).  Since  T could  have  moved  at  most  one 
cell  since  the  last  update,  T could  not  have  caused  either  of  the  contacts  in  cells 
(5, 5)  or  (6,6),  Thus,  virtually  all  of  the  ambiguity  in  the  problem  has  retroactively 
been  resolved  by  the  latest  contact.  In  particular,  T^  probably  caused  the  contact 
in  cell  (5,  5),  and  T^,  the  contact  in  (6,6),  although  there  is  a small  chance  of  the 
alternative  assignment.  Allowing  for  one  update  period  of  additional  target  motion 
the  locations  of  the  three  targets  at  the  end  of  the  second  update  period  must  be 
as  shown  in  Table  IV-3. 

As  noted  earlier,  this  example  is  based  on  the  use  of  Extended  Memory  processing 
in  performing  the  updates  for  sensor  contacts.  Substantially  different  results  are 
obtained  using  either  sequential  processing  or  parallel  processing.  These  results 
are  shown  in  Table  IV-4  and  IV -5.  One  observes  that,  while  in  all  three  tjqjes  of 
processing,  perfect  localization  is  obtained  on  target  T^,  the  degree  of  localization 
on  the  other  two  targets  depends  on  the  method  of  processing,  and  the  correct 
degree  of  localization,  as  supported  by  the  contact  data,  is  given  only  by  Extended 
Memory  processing. 


1 


Example  2.  Figure  IV-2  below  shows  the  actual  track  of  two  targets  which  form 
the  basis  of  our  second  example.  As  in  Example  1,  the  circled  target  locations 
correspond  to  sensor  contacts  on  that  target.  Thus  in  the  example  there  were  five 
contacts  on  target  T , one  each  for  five  successive  update  periods.  In  contrast 
there  is  only  one  contact  on  target  T , that  occurring  during  the  first  processing 
stage.  Once  again  we  assume  that  aU  sensor  contacts  occurred  sufficiently  close 
to  the  end  of  an  update  stage  so  that  subsequent  target  motion  during  that  stage  is 
negligible. 
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TABLE  IV-2  (continued) 


T LY. " 3 (continu e d )_ 
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TARGET  LOCATION  DISTRIBUTIONS  AFTER  2 UPDATES 
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FiQIl^EJJLl.2. 

ACTUAL  TARGET  TRACKS 
(Example  2) 

Note:  k indexes  update  stages  with  the  corresponding 
target  locations  as  indicated. 

Circled  target  locations  indicate  detections. 


We  assume  that  at  the  initiation  of  processing  essentially  nothing  is  known 
about  the  prior  locations  of  the  two  targets.  Consequently,  it  is  assumed  that  the 
prior  location  distribution  of  each  target  is  uniform  over  the  10  x 10  cell  grid 
with  each  target  initially  having  probability  , 01  of  being  in  any  cell. 

As  shown  in  Figure  IV-2,  contacts  are  observed  on  each  of  the  two  targets  during 
the  first  update  stage:  one  placing  a target  in  cell  (7,  3)  and  the  other  placing  a 
target  in  cell  (4,  8),  Because  the  prior  target  location  distributions  are  uniform, 
there  is  no  basis  on  which  to  show  a preference  in  deciding  which  target  caused 
which  response.  As  a result,  under  either  parallel  processing  or  Extended  Memory 
processing,  , 5 of  the  mass  of  each  target  would  be  assigned  to  each  of  the  critical 
cells  after  processing  these  two  contacts.  Sequential  processing,  however,  would 
Introduce  an  artificial  asymmetry  into  the  problem  and  assign  a total  of  somewhat 
more  than  a unit  target  mass  to  the  cell  containing  the  second  contact  processed 
and  correspondingly  somewhat  less  than  a unit  mass  to  the  cell  containing  the  sensor 
contact  processed  first.  This  behavior  is  simply  another  intrinsic  flaw  in  sequential 
processing. 

Table  IV -6  shows  the  target  location  distributions  after  the  fifth  processing 
update  using  the  Extended  Memory  processing  technique.  One  first  observes  that  the 
location  distributions  for  targets  T^^  and  T^  are  identical,  as  in  fact  they  should  be 
in  view  of  the  complete  symmetry  in  the  nature  of  our  information  about  their  locations. 
A contact  was  observed  in  cell  (4,  3)  at  the  end  of  the  fifth  update  period  and  Table  rV-6 
shows  that  this  contact  has  been  allocated  equally  to  the  two  targets.  No  responses 
have  been  observed  on  target  T since  the  first  processing  stage.  Thus  each  target 
map  shows  a second  mode  which  is  quite  diffuse  in  contrast  to  the  mode  induced  by 
the  contact  in  cell  (4,  3). 

An  overall  assessment  is  that,  at  the  end  of  the  fifth  update,  a target  (we  do  not 
know  which  one)  can  be  placed  with  certainty  in  cell  (4,3).  In  addition,  there  appears 
to  be  a second  target  somewhere  in  the  general  northeastern  portion  of  the  operating 
area.  One  concludes  that  the  results  produced  by  Extended  Memory  processing  are 
quite  consistent  with  the  underlying  sensor  contacts  and  appear  to  represent  about 
the  best  locaiization  that  the  available  information  supports. 

Table  rV-7  shows  the  analogous  results  for  Example  2 based  on  parallel  processing 
after  the  fifth  update  (the  results  under  sequential  processing  are  quite  similar).  Again 
the  two  target  location  distributions  are  identical  because  of  the  basic  symmetry  in  the 
problem.  In  addition,  each  distribution  shows  a sharp  mode  in  cell  (4,  3)  corresponding 
to  the  recent  contact  there,  although  parallel  processing  predicts  an  expected  number 
of  targets  of  approximately  1. 07  in  contrast  to  the  1. 00  predicted  by  Extended  Memory 
processing. 

The  most  dramatic  difference  between  Tables  IV -6  and  IV-7,  however,  lies  in 
the  distribution  of  mass  outside  cell  (4,  3).  Parallel  processing  predicts  that  a second 
target  is  likely  to  be  found  in  the  general  vicinity  of  cell  (4,  3)  with  extremely  little 
likelihood  given  to  the  possibility  that  either  of  the  two  targets  is  to  be  found  in  the 
eastern  half  of  the  operating  region. 
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TABLE  IV-6  (continued) 
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Example  2 demonstrates  what  we  have  come  to  refer  to  as  the  quiet  target 
problem.  The  quiet  target  problem  arises  from  the  propensity  of  an  Information 
processor  employing  either  sequential  or  parallel  processing  to  overreact  to  a 
string  of  responses  on  a noisy  target  spanning  a number  of  update  stages  when, 
at  the  outset,  the  initial  location  distribution  for  that  target  had  significant  overlap 
with  the  location  distributions  of  one  or  more  other  targets  that  did  not  generate 
sensor  responses  during  the  same  period.  The  overreaction  that  takes  place  is 
the  gradual  buildup  in  the  area  of  the  contact  string  of  the  probability  mass  not 
only  of  the  target  that  is  actually  causing  the  contacts,  but  also  of  all  other  "quiet" 
targets  that  initially  had  some  chance  of  being  the  responsible  target.  The  net 
result  is  that  the  processor  attempts  to  explain  the  string  of  contacts  by  placing 
with  virtual  certainly  the  locations  of  all  of  the  targets  it  can  in  the  area,  rather 
than  by  simply  attributing  the  chain  of  contacts  to  a single  target. 

Exactly  this  sort  of  situation  typically  arises  in  applications  when  one  target 
is  intrinsically  more  noisy  than  another.  In  this  case  the  noisier  one  tends  to 
produce  a higher  frequency  of  contacts  and  eventually  the  doubling  up  phenomenon 
described  earlier  takes  place.  This  means  that  little  or  no  overall  probability 
is  assigned  to  the  area  near  the  quieter  target.  Eventually  this  quiet  target, 
however,  will  cause  a contact,  in  which  case  the  decision  of  the  processor  about 
the  Identity  of  the  quiet  target  may  be  driven  by  extremely  small  tail  probabilities 
in  the  distributions  of  a number  of  targets  whose  actual  locations  may  be  far 
from  the  location  of  the  contact.  The  result  is  a very  dramatic  shift  in  the  predicted 
location  of  such  targets.  This  then  is  a typical  source  of  the  permutation  problem 
which  we  also  described  earlier. 

In  short,  it  should  be  clear  from  our  two  examples  that  Extended  Memory 
processing  shows  substantially  improved  target  location  prediction  performance 
over  the  other  two  processing  algorithms.  Future  developmental  work  on 
Extended  Memory  processing  will  concentrate  on  generalizing  its  use  to  include 
negative  information  and  false  targets.  ] 
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APPENDIX  A 


AUXILIARY  FORMULAS  FOR  EVALUATING  GAUSSIAN  INTEGRALS 


This  appendix  gives  the  formulas  of  multivariate  Gaussian  analysis  and  matrix 
relations  which  are  needed  in  the  analysis.  Reference  [h]  is  a standard  reference 
for  the  Gaussian  analysis. 


Expressions  for  Multivariate  Gaussian  Quantities 

The  multivariate  Gaussian  density  function  on  N-dimensional  space  is 


f(x)  - k exp  -|{x'P“^x} 


(A-1) 


where  P is  the  covariance  matrix  of  x,  and  k is  a constant  so  chosen  that  the 
integral  of  f overR^  equals  1.  It  is  well  known  that 


T ^ f N -Hx'P"^x}dx  = (27r)^/^ 
R 


(A -2) 


where  |p|  is  the  determinant  of  P.  Clearly,  it  is  true  that  for  any  N-vector  d, 

f N exp -n(x-d)'P"^(x-d)}dx  = f ^ exp -Hx'P'^x}  dx,  (A-3) 

R R 

as  can  be  seen  by  a simple  change  of  variables. 


Matrix  Identities 


Completing  the  square.  The  first  one  is  known  as  completing  the  square.  Let 

F(z,y,CT)  = (z-y)'  A(z-y)  + (z-o)'  B(z-a) 

be  a second  order  homogeneous  relationship  in  z where  A is  a positive  definite 
matrix  and  B is  a positive  semi-definite  matrix. 

We  claim  that 

FlZjy.cx)  = (z-6)'  C(z-6)  + (y-a)'  D(y-CT) 


where 


C = A+B, 

D = B-B(A+B)'1b 
= A-A(AiHB)"^A 
= A(A+B)"1b, 

6 = C"^(Ay+Ba) 

= y + (A+B)"^B((7-y ). 

In  order  to  establish  the  relationship,  note  that 

F(z,y,ff)  (z-6)’C(z-6 )+L 

where  L is  independent  of  z.  Setting  z = 6 we  then  have 

L = (6-y)'  A(6 -y)  + (6 -ct )'B(6 -a) 

- {y-a)'  B(A+B)”^  A A“^  A(A+B)~^  B(y-ff) 
+ {y-a)'  A(A+B)"^  B(A+B)"^  A(y-a). 


A -2 


But  observe  that 


B(A  + B)"1  A = (I-A(A+B)"1)  A 
= A - A(A+B)~1  A 
= A(A+B)~1b, 


whence  we  obtain,  as  claimed, 


L = (v-cr)'  A(A+B)~1(B+B  A~1b)  (A+B)"^  A(y-CT) 
= (y-a)' A(A+B)"1B(I+A"1b)  (I+A"lB)"l(y-(T) 


- (y-ff)'  A(A+B)~^B(y-CT). 


In  particular,  if  B is  invertible,  then 


L = (y-a)'  (A~l+B"l)~l  (Y-ct). 


Inverse  of  a partitioned  matrix.  Let  a matrix  M and  its  inverse  be 
partitioned  into  submatrices 


A I B 


B'  I C 

I 


D i E 


E'  I F 


where  the  dimensions  of  A and  D are  the  same,  but  possibly  different  from  the 
dimensions  of  C and  F.  Define  auxiliary  matrices  S and  T by 


S - A"^  B, 


T = C - B'S. 


(A-10) 


(A-ll) 


(A-12) 


Then 


D - a'^  + ST'^S’, 
E = -ST“\ 

F = t”\ 


(A -13) 
(A -14) 


(A-15) 


Note  also  that 


M = A / F - A 


(A -16) 


Equations  (A-13)  to  (A-16)  can  be  used  directly  to  find  and  | M ^ | given 
A“l,  B,  C,  and  |a|  . Alternately,  given  M“1  (i.e.  , D,  E,  F)  and  | I these 
same  relations  can  be  solved  to  yield  ( A | and  A“^.  For  this  latter  purpose, 
observe  from  (A-14)  and  (A-15)  that 


so  that 


T = F 


S = -EF 


a“^  = D-E(EF'S'  - D-EF"^E', 


(A -17) 


= 1/  mI  • F = m‘  / F . 


(A-18) 


Dimension  reduction  . Let 


= (y-a)'M  (y-a) 


(A -19) 


where  M and  a are  given.  Using  the  partitioning  of  (A-9),  let  M,  M , a,  and  y 
be  partitioned  as; 


D I E 


E'  ! F 


B'  j C 


. a = - 


where  the  dimensions  of  D,  A,  rv,  and  y^  are  compatible.  Then  (A-19)  can  be 


A-4 


,#(y)  " C -O)'  2:  V -CT)+  T (A-20) 

a 0/ 


where 

Z = D - EF  *E', 

G = 0 + EF“^  (y^  - p), 

T = R + (y^-  f"^  (y^-/S). 

I 2 I = I M I /(  f|  . 

These  are  all  in  terms  of  entries  in  M and  a. 

The  point  of  recasting  (A-19)  into  the  form  {A-20)  is  to  reduce  the  dimension  of 
the  variable  y (prior  to  integrating  over  y ).  Observe  that  a is  linear  and  t is  quadratic 

iny.-  ""  " 

/3 

Another  matrix  inverse  result.  * Let  a matrix  A and  its  inverse  B = A be  partitioned 
into  submatrices  as  follows: 


(A -21) 
(A -22) 
(A-23) 

(A-24) 


1 


(A+A)"^  = (A(I  + a"^A))“^  = (I+BA)'‘a~\ 


(A -2  5) 


where  I is  the  identity  matrix.  Furthermore, 


(I  + BA)  = 


^11  ®12^22  ^ 

° ^22^®22'^22  ® 


(A -2  6) 


B A 
32  22 


33 


where  and  are  identity  matrices. 


Let 


(A -2  7) 


By  direct  evaluation 


(I  + BA) 


-1 


P I , 


so  that 


A + A 


-1 


P \ • I B . 


(A -2  8) 


The  inverse  of  (/3)  is,  by  inspection 


(I+A"^A)'^  =1  0 


= I + 


Now 


Hence, 


P - I, 


22 


^^22"^  ~®22'^22^' 


B A ft 
12  22^ 


B A ft 
22  22^^ 

B A ft 
32  22^^ 


”^12^22^ 

^"^22 

■^32^22^ 


^^^^22^  ■ ^^^2^®22  22^'^22^  ^ " ^^22  ^®22^  ^ so  that  (A -2 5)  becomes 


M 


A PPENDIX  B 


EXTENDED  MEMORY  PROCESSING  METHODOLOGY 


In  this  appendix  we  develop  the  mathenaatical  theory  underlying  the  information 
updating  algorithm  we  have  called  Extended  Memory  processing. 

As  the  examples  in  the  fourth  section  of  Chapter  IV  clearly  demonstrated,  a 
Bayesian  updating  scheme,  assuming  target  location  independence  in  a multi-target 
environment  and  operating  solely  on  the  marginal  target  location  distributions, 
generally  runs  into  serious  trouble  when  confronted  with  unresolved  sensor  responses. 

A sensor  response  (or  response  pattern  when  preprocessing  is  employed)  is  said 
to  be  resolvable  if  one  can,  at  least  in  a Bayesian  statistical  sense,  infer  uniquely 
the  identity  of  the  target  that  generated  the  response.  In  ASW  applications  the 
resolvable  responses  may  be  limited  to  port  arrivals  and  departures  and  any  sen- 
sor response  such  that  the  prior  target  location  distributions  at  the  time  of  the 
response  place  only  one  target  within  possible  detection  range  of  the  associated 
sensor(s). 

In  the  first  section  below  we  introduce  the  notion  of  a response  pattern -to -target 
association  distribution  and  describe  the  use  of  such  distributions  in  updating  target 
location  distributions.  The  updating  of  the  response  pattern -to -target  association 
distributions  themselves  is  described  in  the  second  section. 

Response  Pattern-to-Target  Association  (Scenario)  Distributions 

We  begin  by  indexing  the  response  patterns  Cj,  C2,  • • . in  order  of  processing 
(usually  approximate  chronological  order).  Included  in  this  list  are  all  target  port 
departures  and  arrivals.  Also  index  the  targets  Tj,  T2, ...  in  order  of  port  departure. 
We  assume  here  that  all  contacts  are  on  valid  targets.  The  generalization  to  false 
targets  is  reasonably  straightforward. 

We  next  define  the  random  variables 

Nj  = index  of  the  target  that  caused  response  pattern 
Cj,  i = 1,2,  ... 


and  the  random  vector 


S 

i 


(Ni.  N^. 


N.). 


Suppose  for  the  moment  that  the  Nj  were  not  random  variables  but  deterministically 
known  at  the  time  each  response  pattern  was  reported  to  the  processor.  Assume,  for 
example,  that  the  radiated  noise  characteristics  of  the  various  targets  are  sufficiently 
different  to  permit  this  degree  of  target  identification.  The  target  coupling  problem 
would  then  completely  disappear  since  the  target  location  distributions  updated  for 
both  motion  and  responses,  but  conditioned  on  a specific  association  of  responses  with 
targets,  would  be  conditionally  mutually  independent.  In  making  this  statement, 
we  of  course  assume  that  the  targets  may  be  treated  as  moving  independently. 


Unfortunately,  there  are  many  cases  in  practice  in  which  the  identity  of  the 
target  that  generated  a specific  response  pattern  cannot  be  unambiguously  inferred. 
Nevertheless,  our  statement  about  the  conditional  independence  of  the  target  location 
distributions  given  an  assignment  of  all  past  responses  to  targets  points  the  way 
toward  a possible  solution  to  the  target  coupling  problem.  The  fundamental  idea  in 
Extended  Memory  processing  is  simply  to  produce  a separate  estimate  of  each 
currently  active  target's  location  distribution  for  each  possible  scenario,  i.  e. , for 
each  specific  assignment  of  all  past  response  patterns  to  targets.  Weighted  averages 
of  these  conditional  location  distributions,  the  weights  being  the  probabilities 
associated  with  each  scenario,  would  then  give  current  estimates  of  each  target's 
(unconditioned)  location  distribution. 

To  make  all  of  this  more  precise  requires  some  work.  The  first  step  is  to 
define  the  probability  distribution  on  scenarios  after  processing  the  i^^  joint  response 
pattern: 


"2’ 


,n.)  = Pr{Nj=nj,  = n^. 


(B-1) 


We  suppose  that  by  one  means  or  another,  estimates  of  the  F.  are  available  for  each 
index  i.  A Bayesian  procedure  for  generating  such  estimates*  will,  in  fact,  be  described 
in  the  next  section.  We  now  describe  a recursive  procedure  for  developing  posterior 
estimates  of  (i.  e. , updating)  the  target  location  distributions.  This  procedure  makes 
use  both  of  the  current  distribution  on  scenarios  and  the  prior  target  location  distri- 
butions. 


Associated  with  each  response  pattern  index  i,  there  is  the  index  kj  of  the 
information  processing  update  stage  at  which  Cj  is  processed.  In  current  ASWIPS 
processing,  kj  is  simply  the  index  of  the  first  update  following  the  occurrence  of  Ci. 
Also  define  i^  to  be  the  index  of  the  last  response  pattern  processed  through  the 
end  of  the  k*^^  stage.  Thus,  Cij^+2»  •••  » ^ij^+j  is  the  sequence  of  response 


If  none  of  the  indices  ni^+i , is  equal  to  n,  then  Pjj  (•  | Sijj  = (ni,n2 njjj)) 

I Sik+1  = (ni.n2 nik.nik+l .. . . ,nik+i)).  If  one  or  more  of  these  indices  is 

equal  to  n,  then  denote  the  ordered  list  of  such  indices  by  , ‘'rSn'  > 

the  given  association  assigns  to  target  n precisely  the  m response  patterns 


C ^ in\  * • • • » C 


out  of  those  observed  between  the  and  (k+1)^^  updates. 


^v,^n)*  •••• 


be  the  object  location  distributions  corresponding  to  the  response  patterns 


^ (n) » ^ (n)  > • • • » C (n) 

V i ' V A V 

1 2 m 


VV' 
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\(n)(x)  j • I S.^  = {n^,n^ n^)^ 

\/n)  O'*)'  Pr  I \ ' \>) 


Equation  (B-4)  gives  the  updating  formula  for  the  conditional  target  location  dis- 
tributions. It  remains  to  show  how  the  unconditioned  target  location  distributions  are 
obtained.  For  this  we  define 

p^^  (x)  = Pr  {target  n is  in  cell  x at  time  of  stage  update}  , 
n 

where;  as  in  the  conditional  target  location  distributions,  it  is  to  be  understood  that 
the  pjj^^  reflect  target  motion  through  the  end  of  the  k^^  update  as  well  as  all  response 
patterns  through  Cj[j^ . Then,  letting  ey!  |^  denote  the  set  of  possible  k-vector  values 


of  S.  , we  set 


pf  >'  (X, . V 

" (n  .n  )e/Vk+l 

^ ^ Hc+1 


F;  (rij.n  ,...,n  "• 

k+1  k+1  ^k+1 

(B-5) 


Thus,  the  unconditioned  target  location  distribution  for  target  n is  an  overlay  of 
the  conditional  target  location  distributions  weighted  by  probabilities  of  the 

corresponding  scenarios.  Observe  that  equation  (B-5)  has  the  effect  of  relating  the 
current  assessment  of  the  location  of  target  n to  the  scenario  weights  not  just  of  the 
latest  round  of  response  patterns,  Cjj^+i, . . . but  to  those  of  all  past  resix)nse 

patterns.  Thus,  changes  at  the  (k+l)st  update  in  the  scenario  weights  of  response 
patterns  first  recorded  at  earlier  updates  can  have  significant  impact  on  the 
stage  target  location  distributions.  This  memory  feature  is  an  essential  advantage 
of  Extended  Memory  processing  over  both  sequential  processing  and  parallel  rrocessing. 


> Before  turning  to  the  Bayesian  updating  of  the  scenario  distribution,  a few 

i comments  are  in  order  on  the  computational  feasibility  of  the  target  location  distribution 

[ updating  procedure  we  have  just  described. 


i It  would  appear  superficially  that  the  number  of  conditional  target  location 

I distributions  which  must  be  stored  in  computer  memory  and  updated  at  stage  k is  on 

I the  order  of  N • 2^*^,  where  N is  the  number  of  active  targets  at  stage  k and,  as 

I defined  earlier,  ij^  is  the  number  of  response  patterns  processed  through  stage  k. 

t Since  ij^  grows  more  or  less  linearly  with  time  (assuming  a roughly  constant  data 

rate),  2^*^  can  become  quite  large  for  a lengthy  processing  period.  With  N - 7 and  an 
i average  of  2 response  patterns  per  update,  N • 2^^^  = 7 • 2^*^®  after  50  updates.  This 

I number  is  even  larger  than  the  (1000)'  elements  of  the  state  space  associated  with 

I the  multivariate  joint  location  distribution  for  7 targets  in  a grid  of  1000  cells,  making 

i the  situation  look  grim,  indeed. 

Fortunately,  the  situation  in  ASW  application  is  not  nearly  as  bad  as  the  exlreme 
we  have  described.  First  af  all,  many  response  patterns  are  resolvable,  i.  e. , uniquely 
assignable  to  an  identifiable  target.  Those  response  patterns  which  are  unresolved 
generally  are  associable  with  one  of  two  targets  or,  perhaps  on  rare  occasions, 
with  one  of  three  or  four  targets.  Another  factor  is  that  it  frequently  happens  that 
even  when  a response  pattern  Cj  is  unresolved,  it  is  uniquely  associable  with  an 
earlier  response  pattern  Cj(i  < j)  in  the  sense  that  C|^  (within  the  accuracies  of  Bayesian 
inference)  must  have  been  caused  by  the  same  target  (whatever  its  true  identity)  that 
caused  Cj.  This  situation  arises  when  P'j(. . . , nj, . . . , nj)  = 0 for  nj  / nj.  In  such 
a case  Cj  is  more  or  less  redundant  and  can  be  effectively  dropped  (by  summing  on  nj) 
from  the  list  of  response  patterns  in  favor  of  Cj. 


B-5 


I 


A final  factor  in  reducing  the  dimensionility  of  the  problem  relates  to  target 
motion.  Suppose  response  pattern  C.  was  observed  and  processed  at  time  Then 
as  time  passes,  the  current  location  distribution  of  the  target  that  generated  Cj  diffuses 
until  a point  in  time  is  reached  at  which  that  distribution  is  spread  over  such  a large 
part  of  the  target  operating  region  that  its  usefulness  in  target  localization  is  virtually 
nil.  At  this  point  C.  can  reasonably  be  discarded  from  the  list  of  active  response 
patterns  (again  by  summing  over  n^  in  the  current  scenario  distribution). 

To  put  all  of  this  in  perspective,  consider  a situation  with  7 active  tai  gets  in 
which  only  the  last  8 response  patterns  are  carried  in  the  scenario  distribution,  say 
Cj,  C2,  ....  Cg.  Suppose  C^,  C7,  and  Cg  are  resolvable  with  N4  = 3,  N7  = 6,  and 
Ng  ==  1.  In  addition,  assume  Cj,  C2,  and  Cg  are  unresolved  relative  to  T2  and  T.. 
Finally  suppose  that  Cg  and  Cg  are  unresolved  relative  to  T4  and  T7  but  that  Cg  and 
Cg  are  necessarily  due  to  different  targets.  The  number  of  conditional  target  location 
distributions  carried  in  computer  memory  would  then  be:  3 + 2 • 2'^  ■*  2 • 2 - 23,  which 
is  a far  cry  from  7 • 2®  = 1,  792.  While  this  is  only  a representative  case,  it  does 
tend  to  support  the  conclusion  that  the  updating  scheme  we  have  described  is  computa- 
tionally practical. 

We  now  finally  turn  to  the  problem  of  the  Bayesian  updating  of  the  distribution 
on  scenarios. 


The  Bayesian  Updating  of  the  Distribution  on  Scenarios 

We  assume  that  the  first  response  pattern  C'l  is  a port  departure  by  target  T| 
and  that  C|  is  the  only  pattern  processed  at  the  first  update.  These  restrictions 
can  be  modified  to  correspond  to  any  initial  conditions  including  the  allowance  for 
holdover  targets  from  earlier  (prior  to  the  start  of  the  processing  period)  port 
departures.  However,  the  case  Fj^(l)  = 1 corresponds  to  the  current  setup  in 
ASWIPS,  so  we  make  the  stated  specialization. 

1 

; Suppose,  inductively,  that,  the  Fjj^  (n^,  n^, . . . , njj^)  have  been  computed  for  everv 

I (ni,n2,...,ni,^)  . We  now  wish  to  generate  If  i|^ti  ^ i[^,  so  that  no 

response  patterns  were  observed  between  the  and  (k+l)®'-  updates,  then  Fi|, , j l>’jj^, 

their  domains  of  definition,  in  particular,  being  identical.  ' 

i 

Suppose  then  that  ij^  + j > ij^  with  (following  our  earlier  notation)  the  response 
patterns  , Ci|^+2  . • • • » ^i^+i  generated  between  the  k'^'^  and  (kt^l)St  mxiates. 

I Consider  a particular  as.signment  of  these  responses  to  the  active  targets, 

j i.  e.  , as.sume 

i 

N.  = n.  for  i,  < i < i, 

: Ilk-  k+1 


I 


■ 


I 
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